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Secreted Proteins As Markers For Cell Differentiation 



Field of the invention 

The present invention relates to the use of proteins capable of being secreted across the cell 
membrane as cell-type specific reporters during cell development. In detail, the present 
invention relates to embryonic stem (ES) cells stably transfected with a DNA construct 
comprising a DNA sequence encoding a secreted protein, which is operably linked to a cell- 
and/or development-dependent promoter; a method for preparing such ES cells; a cell culture 
obtainable by culturing said ES cells; a method for the toxicological examination of 
substances using such cell cultures as well as methods for the identification of differentiation 
and growth factors for induction of selectively differentiated cells which can be used as a 
source for tissue grafts; a method for producing transgenic non-human mammals using said 
ES cells; a transgenic non-human mammal obtainable by said method; and a method for 
examining stages of cellular development using cells of such a non-human mammal. The 
present invention also concerns assays to identify culture conditions and substances that 
promote the differentiation into a desired cell type, or to determine the toxicity of a compound 
for certain cell types. Those assays are particularly suited to give indications of possible 
developmental and differentiation disturbances during embryogenesis. 

Background art 

Precursor cells have become a central interest in medical research. On the one hand precursors 
can replace cells that are senescent or damaged by injury or disease and on the other hand 
these cells represent an ideal model for studying development and differentiation and the 
factors influencing these processes. Employing conventional cell lines for these studies has 
the disadvantage that individual cell lines may not be fully representative of the complex 
biology of an intact organism. Moreover, even repeating the tests in multiple cell lines does 
not reproduce or account for the complex interactions among cells and tissues that occurs in 
an organism. 

This is the main reason why toxicity testing in cell lines can not replace conventional animal 
tests. New chemical compounds are constantly being developed and tested on animals. In 
addition to industrial and household chemicals, a number of chemical compositions are 
developed each year for use as pharmaceuticals. Rules regarding the testing of potential 
pharmaceuticals are promulgated by the Food and Drug Administration ("FDA"), which 
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currently requires comprehensive testing of toxicity, mutagenicity, and other effects in at least 
two species before a drug candidate can be entered into human clinical trials. Preclinical 
toxicity testing alone costs some hundreds of thousands of dollars. Despite this huge 
investment, almost one third of all prospective human therapeutics fail in the first phase of 

5 human clinical trials because of unexpected toxicity. It is clear that currently available 
toxicological screening assays do not detect all toxicities associated with human therapy or 
exposure to chemicals in the environment. Better means of screening potential therapeutics or 
chemicals in general for potential toxicity would reduce the cost and uncertainty of 
developing new therapeutics and materials, for example for use in medical devices or in other 

10 devices or goods humans are exposed to every day. 

The detection of teratogenic and/or embryotoxic properties of chemical agents occurs 
presently by determination of the reproduction toxicity of test substances following single or 
multi-administrations to pregnant laboratory mammals and by tests of the embryotoxicity in 

15 the early stages of pregnancy. Furthermore, in vitro tests are performed with mammal 
embryos (Neubert and Merker, de Gruyter, Berlin-New York (1981)) and with embryonic 
organs for teratogenicity tests. These test procedures have however the disadvantage that they 
require the use of a large number of live mammals, in particular rats and mice. In vitro test 
procedures, in which primary cell cultures of limb buds (for example, "Limb Buds", Kochhar, 

20 Teratology 11 (1975), 273-287), or brain parts of embryonic rats (Flint and Orton, Toxicol. 
Appl. Pharmacol. 76 (1984), 383-395) or permanent cell lines of embryonic or adult mammal 
tissue, such as tumor cells of the ovary or embryonic palate cells are employed, do not fulfill, 
strictly speaking, the requirements which are imposed on the teratogenicity tests during the 
embryogenesis, namely giving indications of possible disgenesis or developmental 

25 disturbances. 

Efforts have been made for a couple of years to employ permanent cultures of 
totipotent/pluripotent embryonic stem cells (ES cells) for the detection of embryotoxic and 
mutagenic substances; see, e.g. Laschinski et al., Reproductive Toxicol. 5 (1991), 57-65; 
30 Newall and Beedles, Toxicol, in Vitro 8 (1994), 697-701; Sehlmeyer and Wobus, Mutation 
Res. 324(1994), 69-76. 

All those systems make use of the biological properties of the test ES cells or of a reporter 
system that has been introduced into the cells. 
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For example, US-A-6,498,018 describes a method for determming the effect of a biological 
agent by contacting a cell culture with a biological agent. The cell culture contains human 
multipotent CNS neural stem cells that are derived from primary CNS neural tissue and a 
culture medium with preselected growth factors. The read-out is provided by the effect of a 
5 biological agent on the presence or absence on a biological function or property ascribabie to 
the cell culture. A major disadvantage of this system is the fact that particular biological 
functions or properties inherent to a certain culture of cells are difficult to measure and often 
involve the destruction of a large part of the culture in order to obtain enough material for the 
assay. 

10 WO02/086073 discloses a method for the positive selection of neuronal cells differentiated 
from nuclear transfer embryonic stem cells by taking advantage of the neural stem cell marker 
nectin. This method is limited to neural cell types expressing nectin naturally. 
US-A-6,007,993 describes an in vitro test procedure for the detection of chemically-induced 
embryotoxic (for example also teratogenic) effects based on differentiated pluripotent 

15 embryonic stem (ES) cells from the mouse and rat and using embryonic germ (EG) cells 
obtained established from primordial germ cells. Stable transgenic ES or EG stem cell clones 
are constructed, wherein a bacterial reporter gene, LacZ or the luciferase gene, is brought 
under the control of tissue-specific promoters or developmental control genes. Following 
differentiation of the ES cells in the presence of teratogenic substances into the different 

20 germination path derivatives, there occurs a differentiation-dependent expression in the cells, 
due to the activity of the tissue-specific promoters. The activation, repression or modulation 
of these tissue-specific genes is detected based on a reaction depending on the reporter gene 
employed, for example the X-Gal assay. 

WO99/01552 discloses embryonic stem (ES) cells, which are transfected in a stable manner 
25 with a DNA construct encoding a non-cell damaging fluorescent protein and operatively 
linked thereto a cell- or development-dependent promoter. Also disclosed is a method for 
toxicological monitoring of substances using these ES cell cultures. 

Although the above described methods employ semi-quantitative as well as relatively simple 
30 and robust assays, those assays usually involve the destruction of the tested cells. The 
preparation of a cell lysate required by many of the described conventional assay systems 
results in a high intra- as well as inter-assay variability. For repeated measurements during the 
course of differentiation many parallel cultures have to be assayed with the associated 
problems of variability. Laborious preparation of lysates also renders the assay less amenable 
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to high-throughput formats. Using fluorescent reporter proteins can overcome some of those 
problems. However, GFP expression has already shown some toxic effects (Liu et al. 
Biochem. Biophys. Res. Comm. 260 (1999), 712-717). Another disadvantage is the auto- 
fluorescence of aggregates of ES cells (embryoid bodies) which gives a high background in 
5 fluorescence measurements. 

The technical problem of readily monitoring cells differentiating into a certain cell type and 
thereby providing a reliable and highly sensitive test system for toxicity screening, 
differentiation promoting agents and/or appropriate culture conditions, medium supplements, 
10 is achieved by providing the embodiments characterized in the claims, and described further 
below. 



Summary of the invention 

The present invention relates to a method of monitoring cell differentiation comprising 
15 culturing cells capable of differentiating into at least one particular cell type, wherein said 
cells contain at least one recombinant nucleic acid molecule comprising a reporter gene 
encoding a product that is secreted upon cell differentiation, under conditions allowing 
differentiation of the cells, or maintaining a non-human animal comprising such cells under 
appropriate conditions; and determining the amount or activity or amount of the reporter gene 
20 product either within a body fluid of said transgenic non-human animal or the cell culture 
medium or sample thereof. 

The present invention also concerns the mentioned reporter gene constructs for monitoring 
cell differentiation, as well as cells, aggregates of at least one cell type, tissues comprising 
25 those cells or aggregates, organs comprising said cells or tissue, implants and transplants 
comprising said organ, all of which are characterized by comprising such a reporter gene 
construct. 

Furthermore, the present invention relates to a non-human animal and a composition of matter 
30 both characterized by comprising a reporter gene construct, a cell, a cell aggregate, a tissue 
and/or an organ described herein. 
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In addition, the instant invention relates to an array or chip which contains the cells, cell 
aggregates or tissues of the invention attached to a solid support or suspended thereon as well 
as to an apparatus for analyzing said array or chip. 

5 In particular, the present invention relates to test systems to identify substances that influence 
the differentiation of cells into certain cell types- Therefore, the present invention provides a 
method for obtaining and/or profiling a modulator of cell differentiation. This method 
comprises contacting a test sample containing a cell, cell aggregate, tissue, organ or a non- 
human transgenic animal as defined above with the substance to be tested; and then 
10 determining the effect of the test substance on the amount of the reporter gene product or 
activity compared to a control sample or animal. 

The test system provided by the present invention is useful for drug screening purposes. It is 
envisaged to obtain and manufacture drugs which promote or inhibit the formation of said 
15 specific cell types using the method described above, wherein an enhanced or reduced level or 
activity of the reporter gene product is indicative for the drug. An enhanced or reduced level 
or activity of the reporter gene product, respectively, can also be indicative for an agent or 
compound which supports wound healing and/or healing of damaged tissue or toxicity of 
compounds, respectively. 

20 

The present invention additionally relates to a kit useful for conducting the methods described 
above, which contains, for example, a reporter gene construct and/or a cell as defined above, 
and standard compounds, like cell culture media, selection agents, detection agents for the 
reporter molecule and control samples. 

25 

Furthermore, methods of conducting a drug discovery and/or a target discovery business are 
disclosed. Both methods involve providing one or more assay systems described above for the 
identification of modulators of cell differentiation; and/or conducting therapeutic profiling of 
the modulators identified, or further analogs thereof, for efficacy and toxicity in the transgenic 
30 animals provided by the present invention. The modulators identified using the methods 
provided by the invention are then either formulated in a pharmaceutical composition or 
licensed to a third party for further drug development and/or sales. 
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The present invention also relates to modulators of cell differentiation which have been 
identified using the methods of the invention as well as to pharmaceutical compositions 
comprising such modulators for use in the modulation of cell differentiation and the treatment 
of corresponding diseases. 

5 

Furthermore, the present invention relates to the use of a reporter gene construct, a cell, a cell 
aggregate, tissue, organ, implant or transplant, a non-human animal, a composition, an array 
or an apparatus provided herein in drug discovery, pharmacokinetic or pharmacological 
profiling. 

10 

Brief description of the figure 

Figure 1: SEAP expression driven by the cardiac-specific aMHC promoter by cells 
differentiated from ES cells in the presence of retinoic acid. 
ES cells transfected with the a-MHC-SEAP vector (containing the SEAP gene 

15 under the control of the heart-specific aMHC promoter) were differentiated in 

the presence of ail-trans retinoic acid at concentrations indicated. After 13 days 
culture supernatants were assayed for SEAP activity. Shown are mean values of 
duplicate samples from which background SEAP activity was substracted. 
Activities are expressed as percentage of SEAP activity in samples containing 

20 only solvent (DMSO) but no retinoic acid. 

Definitions 

For the purposes of this description, the term "stem cell" can refer to either stem cell or germ 
cell, for example embryonic stem (ES) and germ (EG), respectively. Minimally, a stem cell 

25 has the ability to proliferate and form cells of more than one different phenotype, and is also 
capable of self renewal-either as part of the same culture, or when cultured under different 
conditions. Embryonic stem cells are also typically telomerase positive and OCT-4 positive. 
Telomerase activity can be determined using TRAP activity assay (Kim et al., Science 266 
(1997), 2011), using a commercially available kit (TRAPeze(R) XK Telomerase Detection 

30 Kit, Cat. s7707; Intergen Co., Purchase N.Y.; or TeloTAGGG(TM) Telomerase PCR 
ELISAplus, Cat. 2,013,89; Roche Diagnostics, Indianapolis). hTERT expression can also be 
evaluated at the mRNA level by RT-PCR. The LightCycler TeloTAGGG(TM) hTERT 
quantification kit (Cat. 3,012,344; Roche Diagnostics) is available commercially for research 
purposes. 
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In accordance with the present invention, the term embryonic stem (ES) cell includes any 
multi- or pluripotent stem cell derived from pre-embryonic, embryonic, or fetal tissue at any 
time after fertilization, and having the characteristic of being capable under appropriate 
conditions of producing progeny of several different cell types that are derivatives of all of the 
three germinal layers (endoderm, mesoderm, and ectoderm), according to a standard art- 
accepted test, such as the ability to form a teratoma in 8-12 week old SCID mice. 
"Embryonic germ cells" or "EG cells" are cells derived from primordial germ cells. The term 
"embryonic germ cell" is used to describe cells of the present invention that exhibit an 
embryonic pluripotent cell phenotype. The terms "human embryonic germ cell (EG)" or 
"embryonic germ cell" can be used interchangeably herein to describe mammalian, preferably 
human cells, or cell lines thereof, of the present invention mat exhibit a pluripotent embryonic 
stem cell phenotype as defined herein. Thus, EG cells are capable of differentiation into cells 
of ectodermal, endodermal, and mesodermal germ layers. EG cells can also be characterized 
by the presence or absence of markers associated with specific epitope sites identified by the 
binding of particular antibodies and the absence of certain markers as identified by the lack of 
binding of certain antibodies. 
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"Pluripotent" refers to cells that retain the developmental potential to differentiate into a wide 
range of cell lineages including the germ line. The terms "embryonic stem cell phenotype" 
and "embryonic stem-like cell" also are used interchangeably herein to describe cells that are 
undifferentiated and thus are pluripotent cells. 

Included in the definition of ES cells are embryonic cells of various types, exemplified by 
human embryonic stem cells, described by Thomson et al. (Science 282 (1998), 1145); 
embryonic stem cells from other primates, such as Rhesus stem cells (Thomson et al., Proc. 
Natl. Acad. Sci. USA 92 (1995), 7844), marmoset stem cells (Thomson et al., Biol. Reprod. 
55 (1996), 254) and human embryonic germ (hEG) cells (Shamblott et al., Proc. Natl. Acad. 
Sci. USA 95 (1998), 13726). Other types of pluripotent cells are also included in the term. 
Any cells of mammal origin that are capable of producing progeny that are derivatives of all 
three germinal layers are included, regardless of whether they were derived from embryonic 
tissue, fetal tissue, or other sources. The stem cells employed in accordance with the present 
invention that are preferably (but not always necessarily) karyotypically normal. However, it 
is preferred not to use ES cells that are derived from a malignant source. 
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"Feeder cells" or "feeders" are terms used to describe cells of one type that are co-cultured 
with cells of another type, to provide an environment in which the cells of the second type can 
grow. The feeder cells are optionally from a different species as the cells they are supporting. 
For example, certain types of ES ceils can be supported by primary mouse embryonic 
fibroblasts, immortalized mouse embryonic fibroblasts(such as murine STO cells, e.g., Martin 
and Evans, Proc. Natl. Acad. Sci USA 72 (1975), 1441-1445), or human fibroblast-like cells 
differentiated from human ES cells, as described later in this disclosure. The term "STO cell" 
refers to embryonic fibroblast mouse cells such as are commercially available and include 
those deposited as ATCC CRL 1503. 

The term "embryoid bodies" (EBs) is a term of art synonymous with "aggregate bodies". The 
terms refer to aggregates of differentiated and undifferentiated cells that appear when ES cells 
overgrow in monolayer cultures, or are maintained in suspension cultures. Embryoid bodies 
are a mixture of different cell types, typically from several germ layers, distinguishable by 
morphological criteria; see also infra. As used herein, "embryoid body", "EB"or "EB cells" 
typically refers to a morphological structure comprised of a population of cells, the majority 
of which are derived from embryonic stem (ES) cells that have undergone differentiation. 
Under culture conditions suitable for EB formation (e.g., the removal of Leukemia inhibitory 
factor or other, similar blocking factors), ES cells proliferate and form small mass of cells that 
begin to differentiate. In the first phase of differentiation, usually corresponding to about days 
1-4 of differentiation for humans, the small mass of cells forms a layer of endodermal cells on 
the outer layer, and is considered a "simple embryoid body". In the second phase, usually 
corresponding to about days 3-20 post-differentiation for humans, "complex embryoid 
bodies" are formed, which are characterized by extensive differentiation of ectodermal and 
mesodermal cells and derivative tissues. As used herein, the term "embryoid body" or "EB" 
encompasses both simple and complex embryoid bodies unless otherwise required by context. 
The determination of when embryoid bodies have formed in a culture of ES cells is routinely 
made by persons of skill in the art by, for example, visual inspection of the morphology. 
Floating masses of about 20 cells or more are considered to be embryoid bodies; see. e.g., 
Schmitt et al., Genes Dev. 5 (1991), 728-740; Doetschman et al. J. Embryol. Exp. Morph. 87 
(1985), 27-45. It is also understood that the term "embryoid body", "EB", or "EB cells" as 
used herein encompasses a population of cells, the majority of which being pluripotent ceils 
capable of developing into different cellular lineages when cultured under appropriate 
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conditions. As used herein, the term also refers to equivalent structures derived from 
primordial germ cells, which are primitive cells extracted from embryonic gonadal regions; 
see, e.g., Shamblott, et aL (1998), supra. Primordial germ cells, sometimes also referred to in 
the art as EG cells or embryonic germ cells, when treated with appropriate factors form 
5 pluripotent ES cells from which embryoid bodies can be derived; see, e.g., US Patent US-A- 
5,670,372; Shamblott, et al., supra. 

The terms "polynucleotide" and "nucleic acid molecule" refer to a polymer of nucleotides of 
any length. Included are genes and gene fragments, mRNA, tRNA, rRNA, ribozymes, cDNA, 

10 recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA and 
RNA, nucleic acid probes, and primers. As used in this disclosure, the term polynucleotides 
refer interchangeably to double- and single-stranded molecules. Unless otherwise specified or 
required, any embodiment of the invention that is a polynucleotide encompasses both a 
double-stranded form, and each of the two complementary single-stranded forms known or 

15 predicted to make up the double-stranded form. Included are nucleic acid analogs such as 
phosporamidates and thiophosporamidates. 

A cell is said to be "genetically altered", "transfected", or "genetically transformed" when a 
polynucleotide has been transferred into the cell by any suitable means of artificial 
20 manipulation, or where the cell is a progeny of the originally altered cell that has inherited the 
polynucleotide. The polynucleotide will often comprise a transcribable sequence encoding a 
protein of interest, which enables the cell to express the protein at an elevated level. The 
genetic alteration is said to be "inheritable" if progeny of the altered cell have the same 
alteration. 

25 

A "regulatory sequence" or "control sequence" is a nucleotide sequence involved in an 
interaction of molecules that contributes to the functional regulation of a polynucleotide, such 
as replication, duplication, transcription, splicing,, polyadenylation, translation, or 
degradation of the polynucleotide. Transcriptional control elements include promoters, 
30 enhancers, and repressors. 

Particular gene sequences referred to as promoters, like the "aMHC" or "collagen" promoter, 
are polynucleotide sequences derived from the gene referred to that promote transcription of 
an operatively linked gene expression product. It is recognized that various portions of the 
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upstream and intron untranslated gene sequence may in some instances contribute to promoter 
activity, and that all or any subset of these portions may be present in the genetically 
engineered construct referred to. The promoter may be based on the gene sequence of any 
species having the gene, unless explicitly restricted, and may incorporate any additions, 
substitutions or deletions desirable, as long as the ability to promote transcription in the target 
tissue. Genetic constructs designed for treatment of humans typically comprise a segment that 
is at least 90% identical to a promoter sequence of a human gene. A particular sequence can 
be tested for activity and specificity, for example, by operatively linking to a reporter gene; 
see Figure 1. 

According to the present invention, the term "cell- and/or development-dependent promoter" 
is intended to mean a promoter which displays its promoter activity only in particular cell 
types and/or only in particular stages of cellular development, both in cell cultures (embryoid 
bodies) and in transgenic non-human mammals derived from the ES cells according to the 
invention. In addition, any other known cell-specific promoter can be employed, e.g. for nerve 
cells, heart cells, neurons, glia cells, hematopoietic cells, endothelial cells, smooth muscle 
cells, skeletal muscle cells, cartilage cells, fibroblasts and epithelial cells. 

Genetic elements are said to be "operatively linked" if they are in a structural relationship 
permitting them to operate in a manner according to their expected function. For instance, if a 
promoter helps initiate transcription of the coding sequence, the coding sequence can be 
referred to as operatively linked to (or under control of) the promoter. There may be 
intervening sequence between the promoter and coding region so long as this functional 
relationship is maintained. 

In the context of encoding sequences, promoters, and other genetic elements, the term 
"heterologous" indicates that the element is derived from a genotypically distinct entity from 
that of the rest of the entity to which it is being compared. For example, a promoter or gene 
introduced by genetic engineering techniques into an animal of a different species is said to be 
a heterologous polynucleotide. An "endogenous" genetic element is an element that is in the 
same place in the chromosome where it occurs in nature, although other elements may be 
artificially introduced into a neighboring position. 
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The terms "polypeptide", "peptide" and "protein" are used interchangeably in this disclosure 
to refer to polymers of amino acids of any length. The polymer may comprise modified amino 
acids, it may be linear or branched, and it may be interrupted by non-amino acids. 

5 If not stated otherwise the terms "compound", "substance" and "(chemical) composition" are 
used interchangeably herein and include but are not limited to therapeutic agents (or potential 
therapeutic agents), agents of known toxicities such as neurotoxins, hepatic toxins, toxins of 
hematopoietic cells, myotoxins, carcinogens, teratogens, or toxins to one or more 
reproductive organs. The chemical compositions can further be agricultural chemicals, such as 
10 pesticides, fungicides, nematicides, and fertilizers, cosmetics, including so-called 
"cosmeceuticals", industrial wastes or by-products, or environmental contaminants. They can 
also be animal therapeutics or potential animal therapeutics. 

Industrial products that can be tested with the methods of the present invention include 
bleaches, toilet, blocks, washing-up liquids, soap powders and liquids, fabric conditioners, 

15 window, oven, floor, bathroom, kitchen and carpet cleaners, dishwater detergents and rinse 
aids, watersoftening agents, descalers, stain removers, polishes, oil products, paints, paint 
removers, glues, solvents, varnishes, air fresheners, moth balls and insecticides. 
New ingredients for household products are constantly being developed and needed to be 
tested. For example, in recent years new enzymes (to digest stains) and "optical brighteners" 

20 (which make washing appear whiter) have been developed for use in washing powders and 
liquids. New surfactants (which cut through grease to remove ingrained dirt) and chemical 
"builders" (which act as water softeners and enable surfactants to work more effectively) have 
been developed for use in washing powders and liquids, washing-up liquids and various 
cleaning agents. But also medical materials have to be tested, for example dental materials 

25 such as new filling polymers, metal alloys, and bioactive ceramic. Furthermore, chemical 
compositions of any part of a device, such as catheters, electrodes, adhesives, paste, gel or 
cream may be tested with the method of the present invention in different concentrations and 
with different ingredients and impurities present. 

30 As used herein, "molecular profile" or "profile" of a chemical composition or compound 
refers to a pattern of alterations in gene or protein expression, or both, in an ES cell, embryoid 
body, tissue, etc. contacted by the chemical composition compared to a like cell, embryoid 
body or tissue in contact only with culture medium. 
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Detailed description 

Stem cells of various kinds have become an extremely attractive modality in regenerative 
medicine. They can be proliferated in culture, and then differentiated in vitro or in situ into 
the cell types needed. This plasticity makes them ideal models for toxicity testing as well as 

5 investigation of pharmaceuticals such as tissue specific promoting factors. Particularly 
embryoid bodies which consist of different cell types of the three germ layers that interact 
with each other provide a highly sensitive test system. The ability of the stem cells to 
differentiate into the different cell types in the presence of a test substance may depend on the 
ability of the substance to promote differentiation and/or the toxicity of said substance. 

1 0 This method of monitoring cell differentiation comprises: 

- transfecting cells capable of differentiating into at least one particular cell type with at 
least one recombinant nucleic acid molecule comprising a reporter gene encoding a 
product that is secreted upon cell differentiation; 

- introducing the cells into a non-human animal or culturing the cells under conditions 
1 5 allowing differentiation of the cells; and 

. determining the amount or activity of the reporter gene product either within a body fluid 

of said transgenic non-human animal or the cell culture medium. 
The present invention is based on the use of a reporter gene that is secreted. Surprisingly it 
was found that secreted reporter molecules have certain advantages over the use of, for 
20 example, a fluorescent reporter like Green Fluorescent Protein (GFP) and its derivatives. Liu 
et al. (Biochem. Biophys. Res. Comm. 260 (1999) 712-717) observed toxic effects when 
using a GFP-reporter. Such toxic effects can be avoided or at least niinimized if a secreted 
reporter molecule is used, simply by exchanging the culturing medium on a regular basis and 
thereby preventing toxic concentration of the respective reporter molecule building up. 
25 Also, a preferred method for the differentiation of cells is to grow them as embryoid bodies 
(EBs), which show a high autofluorescence rate even when the cells were lysed. This 
background can render the use of fluorescent reporter genes for the monitoring of cell 
differentiation quite inefficient. 

Other routinely used reporter molecules like for example beta-galactosidase or luciferase 
30 require the cells to be tested in a certain assay buffer which means that the cells have to be 
destroyed to determine the activity. The system according to the invention avoids this by 
employing a secreted reporter molecule so that assaying the culture medium of the 
differentiating cells or the body fluid an animal containing said cells is sufficient to monitor 
the reporter activity and thereby the differentiation. 
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For the transfection of the nucleic acid molecule encoding the reporter protein any standard 
method known to the person skilled in the art can be employed. Any suitable expression 
vector for this purpose can be used. Suitable viral vector systems for producing stem cells 
altered according to this invention can be prepared using commercially available virus 

5 components. The introduction of the vector construct or constructs into the cells occurs in a 
known manner, e.g. by transfection or transduction with the help of viral vectors. Transfection 
of vector plasmids into cells can be accomplished by, e.g., electroporation or lipofection, i.e. 
using lipid/DNA complexes. Exemplary is the formulation Lipofectamine 2000(TM), 
available from Gibco/Life Technologies. Another exemplary reagent is FuGENE(TM) 6 

10 Transfection Reagent, a blend of lipids in non-liposomal form and other compounds in 80% 
ethanol, obtainable from Roche Diagnostics Corporation. . Viral vectors comprising effector 
genes are generally described in the publications referenced in the last section. 

The cells to be transfected have to be able to differentiate into at least one particular cell type. 

1 5 This will be the case for the precursor cell for the desired cell type. Precursor cells include but 
are not limited to for example the human neuronal precursor cell line NTERA-2 cl.Dl. 
(Leypoldt et al., J. Neurochem. 76 (2001), 806-814), the human retinal precursor cell line 
KGLDMSM (Ezeonu et al., DNA Cell BioL 19 (2000), 527-537), a human leukemic cell line, 
PER-117, bearing the markers of a T-cell precursor phenotype (Kees, Blood 72 (1988), 1524- 

20 1529) or HL60, a promyelocytic leukemia cell line that differentiates to mature neutrophils 
(Fontana, Proc. Natl. Acad. Sci. USA 77 (1980), 3664-3668). 

Since these precursor cell lines have only a limited range of developmental capacity it is 
preferred that multipotent or pluripotent cells are used. This is the case when said cells are or 
are derived from stem cells or primordial germ cells, which are preferred in the embodiments 
25 of the present invention. 

The invention can be practiced using stem cells of any vertebrate species. Included are stem 
cells from humans; as well as non-human primates, domestic animals, livestock, and other 
non-human mammals. Preferred sources for stem cells are rodents, in particular mice. 
30 Amongst the stem cells suitable for use in this invention are primate pluripotent stem cells 
derived from tissue formed after gestation, such as a blastocyst, or fetal or embryonic tissue 
taken any time during gestation. Non-limiting examples are primary cultures or established 
lines of embryonic stem cells. The invention is also applicable to adult stem cells, for example 
multipotent adult progenitor cells (MAPCs); see, e.g., Reyes and Verfaillie, Ann. N Y Acad. 
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Sci. 938 (2001), 231-235 and Weissman and Anderson, Annu. Rev. Cell. Dev. Biol. 17 
(2001), 387-403. It is also referred to the literature of Anderson et al., Nat. Med. 7 (2001), 
393-395 and Prockop, Science 276 (1997), 71-74, wherein the extraction and culture of adult 
stem cells is described. 

Media for isolating and propagating stem cells can have any of several different formulas, as 
long as the cells obtained have the desired characteristics, and can be propagated further. 
Suitable sources include Iscove's modified Dulbecco's medium (IMDM), Gibco, 
#12440-053; Dulbecco's modified Eagles medium (DMEM), Gibco #11965-092; Knockout 
Dulbecco's modified Eagles medium (KO DMEM), Gibco #10829-018; 200 mM L- 
glutamine, Gibco # 15039-027; non-essential amino acid solution, Gibco 11140-050; [beta]- 
mercaptoethanol, Sigma # M7522; human recombinant basic fibroblast growth factor (bFGF), 
Gibco # 13256-029. Exemplary senim-containing ES medium and conditions for culturing 
stem cells are known, and can be optimized appropriately according to the cell type. Media 
and culture techniques for particular cell types referred to in the previous section are provided 
in the references cited herein. 

Because of their wide range of developmental capacity said stem cells are embryonic stem 
(ES) cells in a particularly preferred embodiment of this invention. 

As mentioned before, several sources for ES cells are at the disposal of the skilled person of 
which human stem cells are preferred for most of the embodiments of the present invention. 
Human embryonic stem cells and their use for preparing different cell and tissue types are 
also described in Reprod. Biomed. Online 4 (2002), 58-63. Embryonic stem cells can be 
isolated from blastocysts of members of the primate species (Thomson et al., Proc. Natl. 
Acad. Sci. USA 92 (1995), 7844). Human Embryonic Germ (EG) cells can be prepared from 
primordial germ cells present in human fetal material taken about 8-11 weeks after the last 
menstrual period. Suitable preparation methods are described in Shamblott et al., Proc. Natl. 
Acad. Sci. USA 95 (1998), 13726. Methods for making cells that resemble embryonic stem 
cells or embryonic germ cells in morphology and pluripotency derived from primordial germ 
cells isolated from human embryonic tissue, such as from the gonadal ridges of human 
embryo, are described in US patent US-A-6,245,566. 



Recently, is has been reported that exfoliated human deciduous tooth, a comparably very 
accessible tissue, contains multipotent stem cells that were identified to be a population of 
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highly proliferative, clonogenic cells capable of differentiating into a variety of cell types 
including neural cells, adipocytes, and odontoblasts; see Miura et al., Proc. Natl. Acad. ScL 
USA, (2003), Apr 25; p S0027-8424. After in vivo transplantation, those cells were found to 
be able to induce bone formation, generate dentin, and survive in mouse brain along with 
5 expression of neural markers. Furthermore, multilineage potential of homozygous stem cells 
derived from metaphase II oocytes has been described in by Lin et al. in Stem Cells 21 
(2003), 152-161. Various sources of precursor cells in postnatal muscles and the factors that 
may enhance stem cell participation in the formation of new skeletal and cardiac muscle in 
vivo are reviewed in Grounds et al. J. Histochem. Cytochem. 50 (2002), 589-610. Purification 

10 of rare Hematopoietic Stem Cell(s) (HSC) to homogeneity that home to bone marrow is 
described in US2003/0032185. These adult bone marrow cells are described to have 
tremendous differentiative capacity as they can also differentiate into epithelial cells of the 
liver, lung, gastro-intestinal tract, and skin. This finding may contribute to clinical treatment 
of genetic disease or tissue repair. Furthermore, techniques such as nuclear transfer for 

15 embryo reconstruction may be employed wherein diploid donor nuclei are transplanted into 
enucleated Mil oocytes. This technology along with other procedures that aid in the 
establishment of customized embryonic stem (ES) cell lines that are genetically identical to 
those of the recipient have been reviewed by Colman and Kind, Trends Biotechnol. 18 
(2000), 192-196. 

20 The field of stem cell technology is being reviewed by Kiessling and Anderson, Harvard 
Medical School, in Human Embryonic Stem Cells: An Introduction to the Science and 
Therapeutic Potential; (2003) Jones and Bartlett Publishers; ISBN: 076372341X. 

In order to avoid the use of for example human embryos as the donor for stem cells, which 
25 however seems to be justifiable at least under certain circumstances, it may even be possible 
to employ transgenic non-human animals, in particular mammals as source for embryonic 
stem cells. For example, compositions and methods for making transgenic swines to be used 
as xenograft donors is described in US patent US-A-5,523,226. Likewise, WO97/12035 
describes methods of producing transgenic animals for xenotransplantation. Furthermore, 
30 immunologically compatible animal tissue, suitable for xenotransplantation into human 
patients, is described in WOO 1/88096. Methods for making embryonic germ cells from 
porcine are described for example in US patent US-A-6,545,199. Cells immunologically 
compatible with humans can also be employed for purposes of the present invention. 
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Stem cells can be propagated continuously in culture, using a combination of culture 
conditions that promote proliferation without promoting differentiation. Traditionally, stem 
cells are cultured on a layer of feeder cells, typically fibroblast type cells, often derived from 
embryonic or fetal tissue. The cell lines are plated to near confluence, usually irradiated to 
5 prevent proliferation, and then used to support when cultured in medium conditioned by 
certain cells (e.g. Koopman and Cotton, Exp. Cell 154 (1984), 233-242; Smith and Hooper, 
Devel. Biol. 121 (1987), 1-91), or by the exogenous addition of leukemia inhibitory factor 
(LIF). Such cells can be grown relatively indefinitely using the appropriate culture conditions 
without differentiation. 

10 

In the absence of feeder cells, exogenous leukemia inhibitory factor (LIF), or conditioned 
medium, ES or EG cells spontaneously differentiate into a wide variety of cell types, 
including cells found in each of the endoderm, mesoderm, and ectoderm germ layers. With 
the appropriate combinations of growth and differentiation factors, however, cell 

15 differentiation can be controlled. For example, mouse ES and EG cells can generate cells of 
the hematopoietic lineage in vitro (Keller et al., Mol. Cell. Biol. 13 (1993), 473-486; Palacios 
et al., Proc. Natl. Acad. Sci. USA 92 (1995), 7530-7534; Rich, Blood 86 (1995), 463-472). 
Additionally, mouse ES cells have been used to generate in vitro cultures of neurons (Bain et 
al., Developmental Biology 168 (1995), 342-357; Fraichard et al., J. Cell Science 108 (1995), 

20 3161-3188), cardiomyocytes (heart muscle cells) (Klug et al, Am. J. Physiol. 269 (1995), 
H1913-H1921), skeletal muscle cells (Rohwedel et al., Dev. Biol. 164 (1994), 87-101), 
vascular cells (Wang et al., Development 114 (1992), 303-316), US patent US-A-5 ,773,255 
relates to glucose-responsive insulin secreting pancreatic beta cell lines, US patent US-A- 
5,789,246 relates to hepatocyte precursor cells. Hepatic differentiation of murine embryonic 

25 stem cells is also described in Jones et al., Exp. Cell Res. 272 (2002), 15-22. 

Other progenitors of interest include but are not limited to chondrocytes, osteoblasts, retinal 
pigment epithelial cells, fibroblasts, skin cells such as keratinocytes, dendritic cells, hair 
follicle cells, renal duct epithelial cells, smooth and skeletal muscle cells, testicular 
progenitors, and vascular endothelial cells. 

30 

In certain embodiments of the invention, differentiation is promoted by withdrawing one or 
more medium component(s) that promote(s) growth of undifferentiated cells, or act(s) as an 
inhibitor of differentiation. Examples of such components include certain growth factors, 
mitogens, leukocyte inhibitory factor (LIF), and basic fibroblast growth factor (bFGF). 
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Differentiation may also be promoted by adding a medium component that promotes 
differentiation towards the desired cell lineage, or inhibits the growth of ceils with undesired 
characteristics. 

5 In a preferred embodiment of the present invention, said reporter gene in the recombinant 
nucleic acid molecule described above is operably linked to at least one cell type specific 
regulatory sequence. This embodiment has the advantage of providing not only a monitoring 
system for differentiation per se but also allows observing the differentiation into a certain 
ceil type. The regulatory sequence will drive the expression of the reporter molecule only in 

10 those ceils that acquire the desired cell type by differentiation. Usually, said regulatory 
sequence comprises such promoter and/or enhancer elements. In addition, or alternatively, a 
second reporter gene can be used which is under the control of said cell type specific 
regulatory sequence. The second reporter gene may be the same as used for the differentiation 
dependent expression or different, including conventional reporter genes such as green 

15 fluorescent protein (GFP). Furthermore, the second reporter gene may be present in the 
mentioned recombinant nucleic acid molecule or on a different vector construct, the latter 
option requiring cotransfection of both molecules into the cells. 

In a further embodiment, the differentiated cells containing the recombinant nucleic acid 
20 molecule of the present invention, and optionally further foreign DNA such as vector 
constructs described above, are enriched and separated from non-differentiated cells and/or 
from cells of non-desired cell types. Thus, in accordance with this invention, populations of 
differentiated cells are depleted of relatively undifferentiated cells and/or of cells of undesired 
cell types by using a selection system that is lethal to the undesired cells and cell types. This 
25 can be achieved, for example, by expressing a selectable marker gene under the control of a 
regulatory sequence that causes the gene to be preferentially expressed in the desired cell type 
and/or at a certain stage of development, which renders cells of a specific cell type resistant to 
a lethal effect of an external agent. To accomplish this, the cells are genetically altered with a 
recombinant nucleic acid molecule comprising the selectable marker gene before the process 
30 used to differentiate the cells into the desired lineage. The selection system may be present in 
the recombinant nucleic acid molecule of the present invention or on a different vector 
construct which is used for cotransfection. 
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The cell types, which the differentiation can be directed to, include connecting fibroblasts, 
stromal cells, endothelial cells, glia cells, neural cells, neuronal cells, hematopoietic cells, 
smooth muscle cells, skeletal muscle cells, epithelial cells, and cardiac cells. To monitor 
differentiation into these particular cell types, regulatory sequences have to be operatively 
5 linked to the reporter gene sequence that confer an expression of the reporter which is specific 
for said cell types. 

In a preferred embodiment of the method according to the invention said promoter or 
enhancer is selected from the group consisting of aMHC, MLC2V, VE-cadherin, Tie-2, Flk-1, 

10 Flt-1, GFAP, alpha-1- tubulin and collagen 2 promoter or enhancer. Whereas the pro-alphal 
(II) collagen chain (collagen 2) promoter/enhancer (Zhou et al., J. Cell Sci. 108 (1995), 3677- 
3684) is specific for chondrocytes, the expression in endothelial cells can be driven by the 
VE-cad promoter (Gory et al., Blood 93 (1999), 184-192), the Flt-1 promoter (Quinn et al., 
Biochem. Biophys. Res. Commun. 276 (2000), 1089-1099), the Tie-2 (Schlaeger et al., Proc. 

15 Natl. Acad. Sci. USA 94 (1997), 3058-3063) and the Flk-1 promoter/enhancer (Kappel et al. 
Blood 93 (1999), 4284-4292), respectively. The alpha-1 -tubulin promoter (Gloster et al., J. 
Neurosci. 14 (1994), 7319-7330) and the glial fibrillary acidic protein (GFAP) promoter 
(Besnard et al., J. Biol. Chem. 266 (1991), 18877-18883) are specific for neural cells. The 
smooth muscle heavy chain (SMHC) minimal promoter is specific for smooth muscles 

20 (Kallmeier et al., J. Biol. Chem. 270(1995), 30949-30957). An HSV-1 vector containing the 
rat tyrosine hydroxylase promoter for use in both long-term and cell type-specific expression 
in the midbrain is described by Song et al., in J. Neurochem. 68 (1997), 792-803. Cardiac 
specific promoters such as alpha-myosin heavy chain promoter (aMHC) and the myosin light 
chain-2v (MLC2v) promoter, the latter being specific for ventricular heart muscles are 

25 described further below. 

Further examples for tissue specific promoters are those, which are active in hematopoietic 
cells, cartilage cells or epidermal cells as well as insulin secreting p-cells. 
"Tissue specific" is to be subsumed under the term "cell specific". Regulatory sequences of 
the above described cell types and other cell type specific promoters can be obtained from the 
30 literature; see, e.g., "medline" and NCBI. 

Cardiovascular diseases are still among the most frequent causes of death in the Western 
industrial countries. Only through intensive basic research in this field, the pathophysiological 
causes can be found, and new therapeutic approaches can be devised, and toxicological 
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changes described. For studying the pathogenesis of cardiovascular diseases and for testing 
new pharmacological and toxicological substances, models are needed which, on one hand, 
can be transferred to humans and, on the other hand, can replace the tedious and cost- 
intensive animal models. Still in the year 1991, over 2 million animals have been employed 
5 for animal experiments in the federal states of the former Western Germany alone. 

One starting point of pharmacological and toxicological research which has recently grown 
more and more important is heart differentiation. From the stereotypically proceeding heart 
cell development, conclusions can be drawn to pathological and toxicological changes of 
cardiomyocytes. Thus, for example, it is known that the receptor state and the intracellular 

10 signal cascades are disturbed in cardiac hypertrophy (Yamazaki et al., J. Mol. Cell Cardiol. 27 
(1995), 133-140) and heart insufficiency (Johnson et al., Biochem. Pharmacol. 45 (1993), 
2365-2372). These pathologically altered cardiomyocytes are in part again similar to heart 
cells of early differentiation stages. Thus, besides toxicological tests assays for the 
identification and isolation of substances that are capable of promoting cardiac remodeling are 

15 needed, which can be used for therapeutic interventions in the treatment of patients with 
cardiac dysfunction characterized by, for example, pathological hypertrophy and fibrosis. 

However, the examinations necessary for elucidating the properties of heart cells in early 
differentiation stages are technically difficult to perform with live animals and, if they can be 

20 performed at all, it is only in very complicated studies: On day 12-13, at the earliest, it is 
possible to prepare cardiomyocytes from a murine embryo, but such cells no longer 
correspond to an early cardiac differentiation stage. A detailed analysis of receptor expression 
during various differentiation stages requires a very high expense of animal material and is 
technically difficult to perform as set forth above. Similarly, the observation of the 

25 development of a relatively undifferentiated heart cell over several days or weeks is not 
possible with an animal model. In order to test the use of novel therapeutical agents, e.g., 
inotropic substances or antiarrhythmics or toxical substances, e.g., heavy metals or retinoids, 
an invasive monitoring of animals, e.g., swine, must be performed for weeks in animal 
experiments. 

30 

Now, it has been the object of the present invention to provide a cell based assay which 
enables a simple characterization of the effects compounds on cardiomyogenesis and allows 
for functional examinations rather than being based on reporter genes such as Lac-Z (Niwa et 
al., Gene 108 (1991), 193-199; Wobus et al., J. Mol. Cell Cardiol. 29 (1997), 1525-1539; 
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Metzger et ai., Circ. Res. 78 (1996), 547-552), as with prior methods, the expression of which 
genes can be detected only after cell fixation and by means of a specific substrate. 

Surprisingly, it has been found that ES cells can be stably transfected with a DNA construct in 
which a gene encoding a secreted reporter protein is coupled with a cardiomyocyte specific 
cell- and development-dependent promoter; see the examples and the vector sequence 
depicted in SEQ ID NO: 3. This construct is integrated in the native DNA and after the 
specific activation of intracellular signals, the promoter is activated and the reporter protein 
expressed and secreted into the medium. Thus, ES cells which activate a cell-specific 
10 transcription factor at a certain point of differentiation could be recognized by determining the 
level of reporter protein or its activity in the cell culture medium. 

Thus, in a particular preferred embodiment of the invention, said differentiating cell type are 
cardiomyocytes. For this embodiment, said cell type specific regulatory sequence is 
1 5 preferably atrial and/or ventricular specific. Corresponding regulatory sequences, i.e. cardiac 
specific promoters are described in the prior art; see also supra. For example Nkx-2.5 specific 
for very early cardiomyocytes and mesodermal precursor cells, respectively, (Lints et al., 
Development 119 (1993), 419 431); human-cardiac-a-actin specific for heart tissue, 
(Sartorelli et al., Genes Dev. 4 (1990), 181 1-1822), and MLC-2V specific for ventricular heart 
20 muscle cells (O'Brien et al., Proc. Natl. Acad. Sci. USA. 90 (1993), 5157 5161; Lee et al., 
Mol. Cell Biol. 14 (1994), 1220-1229; Franz et al., Circ Res. 73 (1993), 629-638 and WO-A- 
96/16163). The cardiac specific alpha-myosin heavy chain promoter is described in Palermo 
et al., Cell. Mol. Biol. Res. 41 (1995), 501-519 and Gulick et al., J. Biol. Chem. 266 (1991), 
9180-91855. The expression of the atrial specific myosin heavy chain AMHC1 and the 
25 establishment of anteroposterior polarity in the developing chicken heart is described in 
Yutzey et al., Development 120 (1994), 871-883. 

Since the detection is aided by using a reporter molecule for which a robust, sensitive and 
specific assay exists, a particularly preferred embodiment is the method according to the 
30 invention, wherein said reporter gene product is secreted alkaline phosphatase (SEAP), alpha- 
amylase or invertase. 

SEAP is heat-resistant and not inhibited by L-homoarginine. Further advantages of this 
reporter molecule are its low toxicity an the fact that the protein levels that are secreted 
correspond directly to the amount of intracellular mRNA. Thus the SEAP reporter assay 
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provides a direct measurement for the cell type-specific expression and thereby the 
differentiation. It is also extremely sensitive: it is possible to detect 10" l3 g SEAP protein 
(Yang et al. Biotechniques 23 (1997), 1110-1114). This makes it possible to detect 
differentiation into cell types that are rare. 

The use of alpha-amylase as reporter molecule is described in for example WO98/49320. This 
reporter molecule is particularly attractive because the enzyme is extremely stable over a wide 
range of conditions. The measurement of alpha-amylase activity is simple, quantitative, 
sensitive, safe and inexpensive. It was also shown to be compatible with a SEAP reporter 
system (Schlatter et al., Gene 282 (2002), 19-31). The use of invertase as a marker protein is 
described for example in Mace et al., Virology 188 (1992), 869-874. 

Reporter gene constructs comprising inter alia secreted alkaline phosphatase as the reporter 
molecule and, which therefore can be used as a source for preparing vector constructs 
according to the present invention are described for example in WO00/34435 and Wang et al., 
Gene 279 (2001), 99-108, the latter describing murine SEAP. Detection of SEAP level can be 
performed as described in the Examples and in the prior art; see, e.g., Yang et al. (1997) and 
Wang et al. (2001), supra. The use of human secreted alkaline phosphatase in an in vivo 
mouse reporter gene model to monitor ovarian tumor growth and response to therapeutics has 
been described in Nilsson et al., Cancer Chemother. Pharmacol. 49 (2002), 93-100. This 
model can be adapted to the embodiments of the present invention. Engineered secreted 
alkaline phosphatase reporter genes being capable of being expressed in a mammal for 
extended periods of time and detection of the alkaline phosphatase activity by measuring the 
protein levels or AP activity of a serum or tissue sample from a mammal containing the cell 
with the transgene, or measuring the media from a cultured cell containing the transgene are 
also described in WO02/095068. 

Reporter activity or protein amount, in particular SEAP activity, can be detected in any 
biological fluid such as breast milk, amniotic fluid, seminal plasma, serum and urine. 

Since not all molecules that can be used as a reporter are secreted naturally, the nucleic acid 
molecule introduced into the cells can be genetically engineered to encode a protein that is 
excreted. Particularly preferred is the method described above, wherein said reporter gene 
product comprises a secretory leader sequence. Examples for secretory leader sequences are 
but are not limited to the adenovirus tripartite leader sequence (Chu et al., Biotechniques 5 
(1995), 890-896), the immunoglobulin secretion signal (Schlatter et al., Gene 282 (2002), 19- 
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3 1), the SUC2 signal sequence as well as the human interleukin 2 receptor alpha chain signal 
sequence (Kamiya et al., Tohoku J. Exp. Med. 180 (1996), 297-308). 

In order to select those cells which contain the recombinant nucleic acid molecule according 
to the invention, and optionally other vector constructs such as those described hereinbefore, 
said recombinant nucleic acid molecule and vector constructs, respectively, preferably further 
comprise a selectable marker expressed by multi- or pluripotent cells. Selectable markers are 
known to persons skilled in the art and include for example nucleoside and aminoglycoside- 
antibiotic-resistance genes conferring resistance to, e.g. puromycin, neomycin, or 
10 hygromycin. Further examples for resistance genes are dehydrofolate-reductase, which 
confers a resistance against aminopterine and methotrexate, as well as multi drug resistance 
genes, which confer a resistance against a number of antibiotics, e.g. against vinblastin, 
doxorubicin and actinomycin D; see, e.g., Blau et al., Blood 89 (1997), 146-154, which 
describes the use of the multiple drug resistance gene 1 (MDR1) gene as a dominant 
1 5 selectable marker. 

In a particularly preferred embodiment of the present invention, said selectable marker 
confers resistance to puromycin. Puromycin is particularly suited for the fast elimination of 
non-cardiac cells in adherent culture of transgenic EBs. Furthermore, drug selection of cardiac 

20 cells can be implemented entirely in the suspension culture of transgenic EBs. Hence, it could 
also be shown that purified ES derived cardiomyocytes survive much longer in culture than 
untreated counterparts. Moreover, the elimination of undifferentiated ES cells during drug 
selection process has itself been shown to have clear positive effect on viability and longevity 
of such differentiated ES derived cells as cardiomyocytes. In addition, it could be surprisingly 

25 shown that the release from surrounding non-differentiated cells induces proliferation of 
cardiomyocytes. Thus, the drug selection possesses both purifying and multiplying effect; see, 
e.g., WO02/05 1 987. Thus, the selection does not only provide the means for eliminating cells 
which can not be monitored since they do not carry the reporter gene, is also allows the 
elimination of cells that did not differentiate into the desired cell type provided that the 
30 reporter gene and the selectable marker gene are driven by the same, cell type-specific 
regulatory sequence. 

The method according to the invention, wherein said cells form cell aggregates or tissue-like 
aggregates derived from different ES cell types is a preferred embodiment and particularly 
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preferred are said cells forming embryoid bodies (EBs). As mentioned before, embryoid 
bodies represent a complex group of cells differentiating into different tissues. In one 
embodiment, the cells within an embryoid body are substantially synchronized for their 
differentiation. Accordingly, at known intervals, the majority of the synchronized cells 
differentiate into the three embryonic germ layers and further differentiate into multiple tissue 
types, such as cartilage, bone, smooth and striated muscle, and neural tissue, including 
embryonic ganglia; see also Snodgrass et aL, "Embryonic Stem Cells: Research and Clinical 
Potentials" in Smith and Sacher, eds. Peripheral Blood Stem Cells American Association of 
Blood Banks, Bethesda MD (1993). Thus, the cells within embryoid bodies provide a much 
closer model to the complexity of whole organisms than do traditional single cell or yeast 
assays, while still avoiding the cost and difficulties associated with the use of mice and larger 
mammals. Moreover, the recent availability of human embryoid bodies improves the 
predictive abilities of the invention by providing an even closer vehicle for modeling toxicity 
and identification of drugs useful for the treatment of heart disorders in human organ systems, 
and in humans. 

The embryoid body of the invention comprises a cell population, the majority of which being 
pluripotent cells capable of developing into different cellular lineages when cultured under 
appropriate conditions. It is preferred that the embryoid body comprises at least 51% 
pluripotent cells derived from totipotent ES cells. More preferably, the embryoid body 
comprises at least 75% pluripotent cells derived from totipotent ES cells. And still more 
preferably, the embryoid body comprises at least 95% pluripotent cells derived from 
totipotent ES cells. 

In its simplest form, the method of creating a molecular profile according to the present 
invention involves contacting embryoid bodies with a chemical composition of interest, and 
then determining the alterations in reporter activity or amount of reporter protein, or both, in 
the embryoid body exposed to the chemical composition (the "test embryoid body") compared 
to a embryoid body which was not exposed to the agent (a "control embryoid body"). 
Thus, the method of the present invention can be performed such that it allows self-assembly 
of the cells into the aggregates or tissue-like structures. The term "embryoid bodies" (EBs) is 
a term of art synonymous with "aggregate bodies". The terms refer to aggregates of 
differentiated and undifferentiated cells that appear when ES cells overgrow in monolayer 
cultures, or are maintained in suspension cultures. Embryoid bodies are a mixture of different 
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cell types, typically from several germ layers, distinguishable by morphological criteria. 
WO02/051987 describes a protocol to obtain embryoid bodies. The manufacturing takes place 
preferably with the "hanging drop" method or by methylcellulose culture (Wobus et al., 
Differentiation 48 (1991), 172-182). 
5 Alternatively to this, spinner flasks (stirring cultures) can be used as culture method. 
Therefore, the undifferentiated ES-cells are introduced into stirring cultures and are mixed 
permanently according to an established procedure. Therefore, 10 million ES-cells are 
introduced into 150 ml medium with 20 % FCS and are stirred constantly with the rate of 20 
rpm., wherein the direction of the stirring motion is changed regularly. 24 hours after 
1 0 introduction of the ES-cells an extra 1 00 ml medium with serum is added and thereupon 100 - 
150 ml of the medium is exchanged every day (Wartenberg et al., FASEB J. 15 (2001), 995- 
1005). Under these culture conditions large amounts of ES-cell-derived cells, i.e. 
cardiomyocytes, endothelial cells, neurons etc. depending on the composition of the medium 
can be obtained. The cells are selected by means of the resistance gene either still within the 
1 5 stirring culture or after plating, respectively. 

In a particular preferred embodiment of the present invention, embryoid bodies are prepared 
according a recent developed "mass culture" system employed in the appended examples and 
described in detail in applicant's co-pending application "Novel method for the preparation of 
Embryoid Bodies (EBs) and uses thereof, Attorneys docket no: AX02A15/P-EP, filed on the 
20 same day as the present application, i.e. on July 8, 2003. 

The embryoid bodies used to test the chemical composition can be of any vertebrate species. 
The choice of the particular species from which the embryoid body is derived will typically 
reflect a balance of several factors. First, depending on the purpose of the study, one or more 

25 species may be of particular interest. For example, human embryoid bodies will be of 
particular interest for use with compositions being tested as potential human therapeutics but 
also for toxicological tests for substances including industrial chemicals, while equine, feline, 
bovine, porcine, caprine, canine, or sheep embryoid bodies may be of more interest for a 
potential veterinary therapeutic. Embryoid bodies of other species commonly used in 

30 preclinical testing, such as guinea pigs, mice, rat, rabbits, pigs, and dogs, are also preferred. 
Typically, embryoid bodies of these species will be used for "first pass" screening, or where 
detailed information on toxicity in humans is not needed, or where a result in a murine or 
other one of these laboratory species has been correlated to a known toxicity or other effect in 
humans. Furthermore, with respect to human therapeutics, regulatory agencies generally 
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require animal data before human trials can begin; it will generally be desirable to use 
embryoid bodies of species which will be used in the preclinical animal studies. The results of 
toxicity testing in the embryoid bodies can then guide the researcher on the degree and type of 
toxicity to anticipate during the animal trials. Certain animal species are known in the art to 
5 be better models of human toxicity of different types than are others, and species also differ in 
their ability to metabolize drugs; see, e.g., Williams, Environ. Health Perspect 22 (1978), 
133-138; Duncan, Adv. Sci. 23 (1967), 537-541. Thus, the particular species preferred for use 
in a particular preclinical toxicity study may vary according to the intended use of the drug 
candidate. For example, a species which provide a suitable model for a drug intended to affect 
10 the reproductive system may not be as suitable a model for a drug intended to affect the 
nervous system. Criteria for selecting appropriate species for preclinical testing are well 
known in the art. 

Once an embryoid body culture has been initiated, it can be contacted with a chemical 

15 composition. Conveniently, the chemical composition is in an aqueous solution, preferably in 
a solvent conventionally used in cell culture, for example DMSO, and is introduced to the 
culture medium; see also the examples. The introduction can be by any convenient means, but 
will usually be by means of a pipette, a micropipettor, or a syringe. In some applications, such 
as high throughput screening, the chemical compositions will be introduced by automated 

20 means, such as automated pipetting systems, which may be on robotic arms. Chemical 
compositions can also be introduced into the medium as in powder or solid forms, with or 
without pharmaceutical excipients, binders, and other materials commonly used in 
pharmaceutical compositions, or with other carriers which might be employed in the intended 
use. For example, chemical compositions intended for use as agricultural chemicals or as 

25 petrochemical agents can be introduced into the medium by themselves to test the toxicity of 
those chemicals or agents, or introduced in combination with other materials with which they 
might be used or which might be found in the environment, to determine if the combination of 
the chemicals or agents has a synergistic effect. Typically, the cultures will be shaken at least 
briefly after introduction of a chemical composition to ensure the composition is dispersed 

3 0 throughout the medium. 



The time as which a chemical composition is added to the culture is within the discretion of 
the practitioner and will vary with the particular study objective. Conveniently, the chemical 
composition will be added as soon as the embryoid body develops from the stem cells, 
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permitting the determination of the alteration in protein or gene expression on the 
development of all the tissues of the embryoid body. It may be of interest, however, to focus 
the study on the effect of the composition on a particular tissue type. As previously noted, 
individual tissues, such as muscle, nervous, and hepatic tissue, are known to develop at 
specific times after the embryoid body has formed. Addition of the chemical composition can 
therefore be staged to occur at the time the tissue of interest commences developing, or at a 
chosen time after commencement of that development, in order to observe the effect on 
altering gene or protein expression in the tissue of interest. 

Different amounts of a chemical composition will be used to contact an embryoid body 
depending on the amount of information known about the toxicity of that composition, the 
purposes of the study, the time available, and the resources of the practitioner. A chemical 
composition can be administered at just one concentration, particularly where other studies or 
past work or field experience with the compound have indicated that a particular 
concentration is the one which is most commonly found in the body. More commonly, the 
chemical composition will be added in different concentrations to cultures of embryoid bodies 
run in parallel, so that the effects of the concentration differences on gene or protein 
expression and, hence, the differences in toxicity of the composition at different 
concentrations, can be assessed. Typically, for example, the chemical composition will be 
added at a normal or medium concentration, and bracketed by twofold or fivefold increases 
and decreases in concentration, depending on the degree of precision desired. 

Where the composition is one of unknown toxicity, a preliminary study is conveniently first 
performed to determine the concentration ranges at which the composition will be tested. A 
variety of procedures for determining concentration dosages are known in the art. One 
common procedure, for example, is to determine the dosage at which the agent is directly 
toxic. The practitioner then reduces the dose by one half and performs a dosing study, 
typically by administering the agent of interest at fivefold or twofold dilutions of 
concentration to parallel cultures of cells of the type of interest. For environmental 
contaminants, the composition will usually also be tested at the concentration at which it is 
found in the environment. For agricultural chemicals, such as pesticides which leave residues 
on foodstuffs, the agent will usually be tested at the concentration at which the residue is 
found, although it will likely be tested at other concentrations as well. Thus, the dilution of 
test compounds can be done by making in separated tubes a series of dilution of 50 or 100 
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fold concentrated compounds in DMSO. One or two yl of each dilution are distributed in each 
well before cell suspension distribution. Particularly preferred are stock and dilution series as 
described for retinoic acid in example 3, 

The above considerations with respect to contacting the compounds with the EBs, contacting 
time, etc, also apply to the assays of the invention performed on e.g. ES cells, tissue and non- 
human animals, if applicable. 

The present invention also relates to reporter gene constructs for monitoring cell 
differentiation comprising the recombinant nucleic acid molecule defined previously as well 
as to cells capable of differentiating into at least one particular cell type, that contain said 
reporter gene construct, or are obtainable by the methods described herein. Likewise, cell 
aggregates, tissues, organs, implants or transplants comprising said reporter gene construct, 
obtainable by the methods described herein or constituted from said cells, cell aggregates, 
tissues, organs, implants or transplants are subject of the invention. 

In a still further embodiment, the present invention relates to a transgenic non-human animal 
which can be generated from the mentioned ES cells and ES cell derived cell types and cell 
aggregates; see supra. A method for the production of a transgenic non-human animal, which 
is encompassed by the present invention, for example transgenic mouse, comprises 
introduction of a reporter gene as described above into a germ cell, an embryonic cell, stem 
cell or an egg or a cell derived therefrom. The non-human animal can be used in accordance 
with a screening method of the invention described herein. Production of transgenic embryos 
and screening of those can be performed, e.g., as described by A. L. Joyner Ed., Gene 
Targeting, A Practical Approach (1993), Oxford University Press. A general method for 
making transgenic non-human animals is described in the art, see for example WO 94/24274. 
For making transgenic non-human organisms (which include homologously targeted non- 
human animals), embryonal stem cells (ES cells) are preferred. Murine ES cells, such as AB- 
1 line grown on mitotically inactive SNL76/7 cell feeder layers (McMahon and Bradley, Cell 
62 (1990), 1073-1085) essentially as described (Robertson, in Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach. E. J. Robertson, ed. (Oxford: IRL Press) 
(1987), 71-112) may be used for homologous gene targeting. Other suitable ES lines include, 
but are not limited to, the E14 line (Hooper et al., Nature 326 (1987), 292-295), the D3 line 
(Doetschman et al., J, Embryol. Exp. Morph. 87 (1985), 27-45), the CCE line (Robertson et 
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al., Nature 323 (1986), 445-448), the AK-7 line (Zhuang et al., Cell 77 (1994), 875-884) and 
others; see also supra. The success of generating a mouse line from ES cells bearing a specific 
targeted mutation depends on the pluripotence of the ES cells (i.e., their ability, once injected 
into a host developing embryo, such as a blastocyst or morula, to participate in embryogenesis 

5 and contribute to the germ cells of the resulting animal). The blastocysts containing the 
injected ES cells are allowed to develop in the uteri of pseudopregnant nonhuman females and 
are bom as chimeric mice. The resultant transgenic mice are backcrossed and screened for the 
presence of the correctly targeted transgene (s) by PCR or Southern blot analysis on tail 
biopsy DNA of offspring so as to identify transgenic mice for the reporter locus/loci. For 

10 example, transient expression of a recombinase gene directly introduced into fertilized eggs 
by pronuclear injection provides a fast and efficient method for generating mutant mice with 
desired deletions or translocations in target genes. Efficient removal of loxP-flanked DNA 
sequences in a gene-targeted locus by transient expression of Cre recombinase in fertilized 
eggs is described by Sunaga et al., Mol. Reprod. Dev. 46 (1997), 109-1 13. 

15 

Methods for producing transgenic flies, such as Drosophila melanogaster are also described in 
the art, see for example US-A-4,670,388, Brand and Perrimon, Development 118 (1993), 401- 
415; and Phelps and Brand, Methods 14 (1998), 367-379. Transgenic worms such as C. 
elegans can be generated as described in Mello et al., EMBO J. 10 (1991), 3959-3970, 
20 Plasterk, Methods Cell. Biol. 48 (1 995), 59-80. 

All of the embodiments above can not only be used to monitor cell differentiation and 
optimize culturing conditions using modulators in their pure forms but the effective agent(s) 
might also be contained in a composition. Hence, compositions of matter comprising any one 
25 of those recombinant nucleic acid molecules, cells, cell aggregates, or tissues of the present 
invention as described herein are encompassed in the scope of the present invention. As 
described herein, those compositions and methods of the invention can be used for a variety 
of purposes, for example for analyzing early steps of tissue formation during embryonic 
development or the influence of factors and compounds on this process. 

30 

In a particularly preferred embodiment, the present invention relates to arrays and chips 
comprising a solid support and attached thereto or suspended thereon cells, cell aggregates or 
tissue obtained by the method of the present invention or being in the differentiation process. 
Such arrays generally consist of a substrate of glass, plastic or silicon on which the test cells 
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aggregates or tissues are deposited in a particular pattern. Depending on the type of array, 
these might additionally be covered by a conductor, e.g. gold, platinum, indium-tin-oxide, 
iridium, etc., which allows a direct measurement by employing the conductivity of cells. The 
use of such planar microelectrode arrays for cultured cells and cell aggregates as biosensors is 
5 of particular interest. 

Furthermore the invention relates to an apparatus for analyzing said array, which will be able 
to measure the particular read-out of the assay employed, e.g. SEAP reaction product, in a 
quantitative manner and assign the results to the position on the array. 

The system for monitoring cell differentiation according to the present invention can also be 
employed to identify and characterize substances that modify the differentiation process. 
Populations of differentiated cells and tissue suitable for human administration have to be 
identified after the differentiation process to avoid administering undifferentiated cells with 

15 high tumorigenic potential or cells differentiated into cells that are not needed for the 
particular therapeutic purpose. The positive identification of cells differentiated into a certain 
cell type allows also the optimization of culturing conditions that is necessary to obtain the 
relatively high numbers of differentiated cells needed for therapeutic purposes. Thus, there is 
a need to identify conditions for culturing ES cells that drive their differentiation into the 

20 desired direction. 

Accordingly, a preferred embodiment of the invention is a method of obtaining and/or 
profiling a modulator of cell differentiation comprising: 

contacting a test sample comprising a cell, cell aggregate, tissue or organ or a non-human 
animal as defined above with a test substance; and 
25 - determining the effect of the test substance on the amount of the reporter gene product or 
activity compared to a control sample or animal. 

In particular, the methods of the invention can be used for toxicological, embryotoxic, 
mutagenic, and/or teratogenic in vitro tests; see supra. Another important aspect of the present 
30 invention is therefore a method of determining toxicity, preferably teratogenicity, 
embryotoxicity, chronic or acute toxicity of a compound comprising the steps of the methods 
described herein, wherein a reduced or enhanced level or activity of said reporter gene 
product is indicative for the toxicity of the compound. 
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The assays may be simple "yes/no" assays to determine whether there is a change in 
expression of the secreted reporter protein by detecting reporter, e.g., enzyme activity or the 
amount of reporter protein. The assay may be made quantitative by comparing the amount of 
activity and/or amount of reporter in a test sample with the levels of activity and/or amount in 

5 a standard sample. As described in the examples, the test compound or a plurality of test 
compounds are subjected to the test cell, preferably embryoid body in different concentrations 
or dilution series, preferably at doses that correspond to physiological levels of the 
corresponding type of test compounds; see also example 3. It is thus also possible to easy 
generate compound profiles in purpose similar to those described in WO00/34525. For 

10 example, two or more assays may be used, wherein for each assay the test cells or, e.g., 
embryoid bodies comprise recombinant DNA molecules with different cell- and/or 
development-specific promoters operably linked to the same or different reporters, preferably 
wherein each reporter is a secreted protein, e.g., enzyme, most preferably wherein one or each 
reporter is SEAP. Those assays can be performed in parallel or subsequently; or the results of 

15 one assay may be compared with results of a corresponding assay performed elsewhere. 
Likewise, it is envisaged to provide and use cells which comprise two of the mentioned 
recombinant molecules but with different promoters and reporters as mentioned above. Thus, 
a molecular profile of a test chemical composition can be established by detecting the 
alterations in reporter gene activity or amount of reporter protein in, for example, embryoid 

20 bodies contacted by the test chemical composition as described in previous sections. Once the 
molecular profile of the test composition is determined, it can be compared to that of a 
chemical composition with predetermined toxicities or, preferably, to a library of molecular 
profiles of chemical compositions with predetermined toxicities. The outcome of such 
comparison provide information for one to predict the likelihood of whether the test 

25 composition is toxic, what type of toxicities, and how toxic it would be as compared to the 
other known toxic compositions. 

For the purpose of practicing the invention, the predictions of toxicity of the test composition 
based on its molecular profiles in ES cells, tissue, etc, preferably EB cells does not have to be 
30 100% accurate. To have a major positive impact on the efficiency and costs of drug 
development, one only has to modestly increase the probability that the less toxic and thus 
more successful drug candidates are, for example, on the top half of a prioritized list of new 
drug leads. 
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Since the cells and tissue obtained in accordance with the present invention more closely 
resemble the in vivo situation compared to conventional cell based assays, the results obtained 
by the assays of the present invention are expected to correlate to in vivo teratogenicity or 
5 embryotoxicity of the tested compounds as well. The simple read-out of the reporter molecule 
assay is easier measured and interpreted than determining the influence of a substance by 
monitoring the embryonic and post-natal development of a test animal in all aspects and in 
different stages of development. This requires a number of animals to be sacrificed at 
different ages, whereas by employing the reporter molecule secreted in an developmental- 
10 specific manner, samples of body fluids from the animal can be obtained repeatedly and 
allowing the monitoring over an extended time period up to the life-time of the animal. This 
also reduces the variability of test results. 

Several test substances can be combined and either added simultaneously or sequentially to 
15 gain information about possible enhancing or quenching effects. Thus a further aspect of the 
invention relates to the method described previously, wherein said contacting step further 
includes contacting said test sample or animal with at least one second test substance in the 
presence of said first test substance. Two or more substances tested in combination will 
provide information about their interaction in general. 

20 

A preferred embodiment of the methods according to the present invention involves adding a 
compound known to activate or inhibit differentiation process to the culture medium or 
animal, particularly if this test substance is a therapeutic agent or a mixture thereof. This 
screening may be done, for example, either because the compound is known to have a effect 
25 on the differentiation of certain cell types and is tested to determine its potential for guiding 
the differentiation of other cell types, or because a compound designed to have effects 
elsewhere may have unintended side effects. The last aspect applies particularly to therapeutic 
agents. 

30 Both aspects of drug screening can be adapted to compound library screening methods as well 
as screening industrial chemicals from the shelf. Thus another embodiment of the invention 
encompasses the method described above, wherein preferably in a first screen said test 
substance is comprised in and subjected as a collection of test substances. Such a collection of 
test substances, usually a combinatorial compound library, can have a diversity of about 10 3 
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to about 10 s and is successively reduced in running the method several times until the 
diversity has been reduced to down to a desired factor, preferably down to one compound or 
particular class of compounds. Optionally the method of the present invention can combined 
with other screening methods known in the art. Methods for the generation and use of 

5 peptidomimetic combinatorial libraries are described in the prior art, for example in Ostresh, 
Methods in Enzymology 267 (1996), 220-234 and Domer, Bioorg. Med. Chem. 4 (1996), 
709-715. Drug discovery by dynamic combinatorial libraries is described, for example, m 
Nat. Rev. Drug Discov. 1 (2002), 26-36 and Drug Discov. Today 7 (2002), 117-125. 
Compounds identified using the method of the invention can be further evaluated in single 

1 0 experiments employing other methods of the present invention. 

Compounds and candidate agents are obtained from a wide variety of sources including 
libraries of synthetic or natural compounds, or as mentioned industrial chemicals; see supra. 
For example, numerous means are available for random and directed synthesis of a wide 

15 variety of organic compounds and biomolecules, including expression of randomized 
oligonucleotides and oligopeptides. Alternatively, libraries of natural compounds in the form 
of bacterial, fungal, plant and animal extracts are available or readily produced. For example, 
inhibition of tumor-induced angiogenesis and matrix-metalloproteinase expression in 
confrontation cultures of embryoid bodies and tumor spheroids by plant ingredients used in 

20 traditional Chinese medicine has been described by Wartenberg et al. in Lab. Invest. 83 
(2003), 87-98. 

A preferred embodiment of the screening method is performed on an array as provided by the 
invention 
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The assay methods of the present invention can be in a conventional laboratory format or 
adapted for high throughput screening. The term "high throughput screening" (HTS) refers to 
an assay design that allows easy analysis of multiple samples simultaneously, and capacity for 
robotic manipulation. Another desired feature of high throughput assays is an assay design 
30 that is optimized to reduce reagent usage, or niinimize the number of manipulations in order 
to achieve the analysis desired. Examples of assay formats include 96-well, 384-well or more- 
well plates, levitating droplets, and "lab on a chip" microchannel chips used for liquid 
handling experiments. It is well known by those in the art that as miniaturization of plastic 
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molds and liquid handling devices are advanced, or as improved assay devices are designed, 
that greater numbers of samples may be performed using the design of the present invention. 
In the method of the invention, said one or more cells or tissues are preferably contained in a 
container, for example in a well of a microliter plate, which may be a 24, 96, 384 or 1586 well 
5 plate. Alternatively, the cells can be introduced into a microfluidics device, such as those 
provided by Caliper (Newton, MA, USA). In another preferred embodiment, the method of 
the present invention comprises taking 3, 4, 5, 7, 10 or more measurements, optionally at 
different positions within the container. 

10 The methods according to the invention can also being performed using cells that are 
genetically engineered to (over) express or inhibit the expression of a target gene. Examples 
for genetically altered stem cells are described in WO03/014326 which provides mammalian 
stem cells expressing the VEGF receptor VEGFR-1 or WO95/06409 disclosing human 
hematopoietic stem cells expressing recombinant DNA constructs encoding a recombinant 

15 molecule containing a signal transducing region and an antigenic specificity region. 
Alternatively to using already altered stem cells, said target gene can be transfected at the 
same time as the construct of the present invention conferring the expression of the secreted 
reporter molecule. 

20 HTS, combinatorial library screening as well as conventional assay formats can also be 
employed to identify novel compounds which have an effect on differentiation. Novel as well 
as compounds previously known to modulate cell differentiation can be used to manufacture a 
drug for this purpose. Thus a preferred embodiment of the present invention is a method of 
obtaining and manufacturing a drug which promotes or inhibits formation of specific cell 

25 types comprising the steps of the methods described above, wherein an enhanced or reduced 
level or activity of the reporter gene product is indicative for the drug. 

Compounds of interest encompass numerous chemical classes, though typically they are 
organic molecules. Candidate agents comprise functional groups necessary for structural 
interaction with proteins, particularly hydrogen bonding, and typically include at least an 
30 amine, carbonyl, hydroxyl or carboxyi group, preferably at least two of the functional 
chemical groups. The candidate agents often comprise cyclical carbon or heterocyclic 
structures and/or aromatic or polyaromatic structures substituted with one or more of the 
above functional groups. Candidate agents are also found among biomolecules including 
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peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural 
analogs or combinations thereof. 

Additionally, natural or synthetically produced libraries and compounds are readily modified 
through conventional chemical, physical and biochemical means, and may be used to produce 
5 combinatorial libraries. Known pharmacological agents may be subjected to directed or 
random chemical modifications, such as acylation, alkylation, esterification, amidification, 
etc. to produce structural analogs. 

The compounds may also be included in a sample including fluids to which additional 
10 components have been added, for example components that affect the ionic strength, pH, total 
protein concentration, etc. In addition, the samples may be treated to achieve at least partial 
fractionation or concentration. Biological samples may be stored if care is taken to reduce 
degradation of the compound, e.g. under nitrogen, frozen, or a combination thereof. The 
volume of sample used is sufficient to allow for measurable detection, usually from about 0.1 
15 ul to 1 ml of a biological sample is sufficient. 

Differentiation processes do not only play a major role in embryonic development but are also 
important for an adult organism since the continuous replacement of senescent or cells 
damaged by injury or disease is achieved by precursor cells differentiating into the cells 
20 needed. An important aspect of the present invention is thus a method of manufacturing an 
agent which supports wound healing and/or healing of damaged tissue comprising the steps 
described herein, wherein an enhanced level or activity of the reporter gene product is 
indicative for said agent. 

25 Compounds identified, isolated and/or produced by the methods described above can also be 
used as lead compounds in drug discovery and preparation of drugs or prodrugs. This usually 
involves modifying the lead compound or a derivative thereof or an isolated compound as 
described hereinbefore such as modifying said substance to alter, eliminate and/or derivative a 
portion thereof suspected causing toxicity, increasing bioavailability, solubility and/or half- 

30 life. Accordingly, a preferred embodiment of the present invention encompasses the methods 
described previously and further modifying said substance to alter, eliminate and/or derivative 
a portion thereof suspected causing toxicity, increasing bioavailability, solubility and/or half- 
life. Methods for clinical compound discovery comprises for example ultrahigh-throughput 
screening (Sundberg, Curr. Opin. Biotechnol. 11 (2000), 47-53) for lead identification, and 
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structure-based drug design (Verlinde and Hoi, Structure 2 (1994), 577-587) and 
combinatorial chemistry (Salemme et al., Structure 15 (1997), 319-324) for lead 
optimization. The various steps recited above are generally known in the art. For example, 
computer programs for implementing these techniques are available; e.g., Rein, Computer- 
5 Assisted Modeling of Receptor-Ligand Interactions (Alan Liss, New York, 1989). Methods 
for the preparation of chemical derivatives and analogues are well known to those skilled in 
the art and are described in, for example, Beilstein, Handbook of Organic Chemistry, Springer 
edition New York Inc., 175 Fifth Avenue, New York, N.Y. 10010 U.S.A. and Organic 
Synthesis, Wiley, New York, USA. Furthermore, peptidomimetics and/or computer aided 
10 design of appropriate derivatives and analogues can be used, for example, according to the 
methods described above. Methods for the lead generation in drug discovery also include 
using proteins and detection methods such as mass spectrometry (Cheng et al. J. Am. Chem. 
Soc. 117 (1995), 8859-8860) and some nuclear magnetic resonance (NMR) methods (Fejzo et 
al., Chem. Biol. 6 (1999), 755-769; Lin et al., J. Org. Chem. 62 (1997), 8930-8931). They 
15 may also include or rely on quantitative structure-action relationship (QSAR) analyses 
(Kubinyi, J. Med. Chem. 41 (1993), 2553-2564, Kubinyi, Pharm. Unserer Zeit 23 (1994), 
281-290) combinatorial biochemistry, classical chemistry and others; see, for example, 
Holzgrabe and Bechtold, Pharm. ActaHelv. 74 (2000), 149-155. 

20 In one embodiment, the method of the invention further comprises mixing the substance 
isolated or modified with a pharmaceutically acceptable carrier. Examples of carriers and 
methods of formulation may be found in Remington's Pharmaceutical Sciences. 

Substances are metabolized after their in vivo administration in order to be eliminated either 
25 by excretion or by metabolism to one or more active or inactive metabolites (Meyer, J. 
Pharmacokinet. Biopharm. 24 (1996), 449-459). Thus, rather than using the actual compound 
or drug identified and obtained in accordance with the methods of the present invention a 
corresponding formulation as a pro-drug can be used which is converted into its active form 
in the patient by his/her metabolism. Precautionary measures that may be taken for the 
30 application of pro-drugs and drugs are described in the literature; see, for review, Ozama, J. 
Toxicol. Sci. 21 (1996), 323-329. 

The present invention also relates to a kit containing specific reagents such as those described 
hereinbefore useful for conducting any one of the above described methods of the present 
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invention, containing for example the reporter gene construct described hereinbefore, multi- 
or pluripotent cells, standard compounds, and optionally culture medium, recombinant nucleic 
such acid molecules, detection means for the reporter molecule, in particular for SEAP such 
as EDTA, L-Homoarginine, NBT/BCIP, fixative solution, incubation buffer, PBS, etc. Such a 
5 kit would typically comprise a compartmentalized carrier suitable to hold in close 
confinement at least one container. The carrier would further comprise reagents useful for 
performing said methods. The carrier may also contain a means for detection such as labeled 
enzyme substrates or the like. 

1 0 Yet another aspect of the present invention relates to a method of conducting a drug discovery 
business, comprising: 

- providing one or more assay systems as described herein for identifying a modulator of 
cell differentiation; and/or 

- conducting therapeutic profiling of modulators identified in the previous step, or further 
1 5 analogs thereof, for efficacy and toxicity in animals according to the invention; and 

formulating a pharmaceutical preparation including one or more modulators identified in 
the previous step as having an acceptable therapeutic profile. 

Utilizing the methods described above, the identity of a modulator of differentiation or a 
20 derivative thereof are determined. Agents are identified by their ability to alter the level of 
secreted reporter molecule. For suitable lead compounds that are identified, further 
therapeutic profiling of the agent, or analogs thereof, can be carried out for assessing efficacy 
and toxicity in animals. Those compounds having therapeutic profiles after animal testing can 
be formulated into pharmaceutical preparations for use in humans or for veterinary uses. The 
25 subject business method can include an additional step of establishing a distribution system 
for distributing the pharmaceutical preparation for sale, and may optionally include 
establishing a sales group for marketing the pharmaceutical preparation. 

Instead of developing the identified cell differentiation modulating agents in house, further 
30 drug development can also be achieved by a different company. Thus a further aspect of the 
present invention relates to a method of conducting a target discovery business comprising: 

- providing one or more assay systems described herein for identifying modulators of cell 
differentiation; 
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- (optionally) conducting therapeutic profiling of modulators identified in the previous step 
for efficacy and toxicity in animals according to the invention; and 

- licensing, to a third party, the rights for further drug development and/or sales for 
modulators identified in the first step, or analogs thereof. 

5 For suitable lead compounds that have been identified, further profiling of the agent, or 
further analogs thereof, can be carried out for assessing efficacy and toxicity in animals, 
depending on the modalities of the agreement with the respective third party. Further 
development of those compounds for use in humans or for veterinary uses will then be 
conducted by the third party. The subject business method will usually involve either the sale 

10 or licensing of the rights to develop said compound but may also be conducted as a service, 
offered to drug developing companies for a fee. 

The present invention also relates to modulators of cell differentiation identified according to 
the methods described above as well as to pharmaceutical compositions for use in the 
1 5 modulation of cell differentiation comprising such a modulator. 

The modulator according to the invention can be combined with suitable diluents or carriers, 
preferably those which are pharmaceutical^ acceptable. Examples of such carriers, diluents 
and methods of formulation may be found in Remington's Pharmaceutical Sciences. To form 
20 a pharmaceutical^ acceptable composition suitable for effective administration, such 
compositions will contain an effective amount of the modulator. Carriers or diluents are 
usually sterile and non-toxic, and defined as vehicles commonly used to formulate 
pharmaceutical compositions for animal or human administration. The diluent is selected so 
as not to affect the biological activity of the combination. Examples of such diluents are 
25 distilled water, physiological saline, Ringer's solutions, dextrose solution, and Hank's 
solution. In addition, the pharmaceutical composition or formulation may also include other 
carriers, adjuvants, or nontoxic, nontherapeutic, nonimmunogenic stabilizers and the like. A 
therapeutically effective dose refers to that amount of modulator which is sufficient to achieve 
the desired effect on differentiation of target cells. 
30 Further examples of suitable pharmaceutical carriers are well known in the art and include 
phosphate buffered saline solutions, emulsions, such as oil/water emulsions, various types of 
wetting agents, sterile solutions etc. Compositions comprising such carriers can be formulated 
by well known conventional methods. Accordingly, the present invention also provides a 
method of making a pharmaceutical composition for use in modulating cell differentiation 
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comprising mixing a modulator of cell differentiation identified according to a method of the 
invention with a suitable diluent or carrier. 

The use of a reporter gene construct, a cell, a cell aggregate, tissue, organ, implant or 
5 transplant, a non-human animal, a composition, an array or the apparatus defined herein in 
drug discovery or pharmacokinetic or pharmacological profiling is also preferred embodiment 
of the present invention. Hence, the means and methods of the present invention described 
herein-before can be used in a variety of applications including but not limited to "loss of 
function" assays with ES cells containing homozygous mutations of specific genes, "gain of 
10 function" assays with ES cells overexposing exogenous genes, developmental analysis of 
teratogenic/embryotoxic compounds in vitro, pharmacological assays and the establishment of 
model systems for pathological cell functions, and application of differentiation and growth 
factors for induction of selectively differentiated cells which, can be used as a source for 
tissue grafts; see for review, e.g., Guan et aL, Altex 16 (1999), 135-141. 

15 

As described in the examples, for the expression of the secreted human placental alkaline 
phosphatase protein (SEAP), a particular promoter has been used, i.e. the promoter region of 
the mouse alpha myosin heavy chain gene; see NCBI GenBank Accession No. #U71441 
(SEQ ID NO: 1). In accordance with the present invention, it could be shown that this 
20 promoter region is particularly suited for cell type specific expression of a reporter gene, 
especially for the purposes of the present invention. In addition, further experiments 
performed in accordance with the present invention surprisingly revealed that the mouse 
ventricular myosin regulatory light chain gene promoter (see NCBI GenBank Accession No. 
#AF302688; SEQ ID NO: 2) is particularly suited for the expression of the reporter gene as 
25 well. Thus, the present invention also relates to a recombinant DNA molecule as described 
hereinbefore in a cell-type specific reporter system, comprising the promoter of the mouse 
alpha myosin heavy chain gene and/or of the mouse ventricular myosin regulatory light chain 
gene, in particular the sequence depicted in NCBI GenBank Accession No. #U71441 (SEQ 
ID NO: 1) and #AF302688 (SEQ ID NO: 2), respectively, or a fragment thereof. Typically, 
30 the recombinant DNA molecule is a vector and comprises other functional DNA sequences 
such as ori, bacterial selectable marker, polyadenylation signal, multiple cloning sites, and the 
like. Hence, such vector may also be used to express specifically for examples growth factors 
and the like or target cDNAs in order to elucidate their function during cell development. 
Thus, the present invention involves the use of the mentioned promoter regions for the 
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expression of a DNA sequence encoding a reporter protein or selectable marker; see supra. In 
a particular, the present invention relates to the expression vector described in Example 1 
comprising the nucleotide sequence depicted in SEQ ID NO. 3 as well as to a corresponding 
vector, wherein the mouse alpha myosin heavy chain gene promoter has been replaced by the 
5 mouse ventricular myosin regulatory light chain gene promoter, and their use in the afore- 
described embodiments. 

These and other embodiments are disclosed and encompassed by the description and 
examples of the present invention. Further literature concerning any one of the materials, 
10 methods, uses and compounds to be employed in accordance with the present invention may 
be retrieved from public libraries and databases, using for example electronic devices. For 
example the public database "Medline" may be utilized, which is hosted by the National 
Center for Biotechnology Information and/or the National Library of Medicine at the National 
Institutes of Health. Further databases and web addresses, such as those of the European 
15 Bioinformatics Institute (EBI), which is part of the European Molecular Biology Laboratory 
(EMBL) are known to the person skilled in the art and can also be obtained using internet 
search engines. An overview of patent information in biotechnology and a survey of relevant 
sources of patent information useful for retrospective searching and for current awareness is 
given in Berks, TIBTECH 12 (1994), 352-364. 

20 

The above disclosure generally describes the present invention. A more complete 
understanding can be obtained by reference to the following specific examples and figure 
which are provided herein for purposes of illustration only and are not intended to limit the 
scope of the invention. The contents of all cited references (including literature references, 
25 issued patents, published patent applications as cited throughout this application and 
manufacturer's specifications, instructions, etc) are hereby expressly incorporated by 
reference; however, there is no admission that any document cited is indeed prior art as to the 
present invention. 

30 The practice of the present invention will employ, unless otherwise indicated, conventional 
techniques of cell biology, cell culture, molecular biology, transgenic biology, microbiology, 
recombinant DNA, and immunology, which are within the skill of the art. 
For further elaboration of general techniques concerning stem cell technology, the practitioner 
can refer to standard textbooks and reviews, for example Teratocarcinomas and embryonic 
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stem cells: A practical approach (E. J. Robertson, ed., ERL Press Ltd. 1987); Guide to 
Techniques in Mouse Development (P. M. Wasserman et al., eds., Academic Press 1993); 
Embryonic Stem Cell Differentiation in Vitro (Wiles, Meth. Enzymol. 225 (1993), 900,); 
Properties and uses of Embryonic Stem Cells: Prospects for Application to Human Biology 
5 and Gene Therapy (Rathjen et aL, Reprod. FertiL Dev. 10 (1998), 31,). Differentiation of stem 
cells is reviewed in Robertson, Meth. Cell Biol. 75 (1997), 173; and Pedersen, Reprod. FertiL 
Dev. 10 (1998), 31. Besides the sources for stem cells described already above further 
references are provided; see Evans and Kaufman, Nature 292 (1981), 154-156; Handyside et 
al., Roux's Arch. Dev. Biol., 196 (1987), 185-190; Flechon et al., J. Reprod. FertiL Abstract 
10 Series 6 (1990), 25; Doetschman et al., Dev. BioL 127 (1988), 224-227; Evans et al., 
Theriogenology 33 (1990), 125-128; Notarianni et al., J. Reprod. FertiL Suppl., 43 (1991), 
255-260; Giles et al., BioL Reprod. 44 (Suppl. 1) (1991), 57; Strelchenko et aL, 
Theriogenology 35 (1991), 274; Sukoyan et al., Mol. Reprod. Dev. 93 (1992), 418-431; 
Iannaccone et aL, Dev. BioL 163 (1994), 288-292. 

15 

Methods in molecular genetics and genetic engineering are described generally in the current 
editions of Molecular Cloning: A Laboratory Manual, (Sambrook et al., (1989) Molecular 
Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press); DNA 
Cloning, Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait 

20 ed., 1984); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); 
Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal 
Cells (R. I. Freshney, Alan R. Liss, Inc., 1987); Gene Transfer Vectors for Mammalian Cells 
(Miller & Calos, eds.); Current Protocols in Molecular Biology and Short Protocols in 
Molecular Biology, 3rd Edition (F. M. Ausubel et al., eds.); and Recombinant DNA 

25 Methodology (R. Wu ed., Academic Press). Gene Transfer Vectors For Mammalian Cells (J. 
H. Miller and M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In 
Enzymology, Vols. 154 and 155 (Wu et al. eds.); Immobilized Cells And Enzymes (IRL 
Press, 1986); B. Perbal, A Practical Guide To Molecular Cloning (1984); the treatise, 
Methods In Enzymology (Academic Press, Inc., N.Y.); Immunochemical Methods In Cell 

30 And Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1987); Handbook 
Of Experimental Immunology, Volumes I-IV (D. M. Weir and C. C. Blackwell, eds., 1986). 
Reagents, cloning vectors, and kits for genetic manipulation referred to in this disclosure are 
available from commercial vendors such as BioRad, Stratagene, Invitrogen, and Clontech. 
General techniques in cell culture and media collection are outlined in Large Scale 
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Mammalian Cell Culture (Hu et al, Curr. Opin. Biotechnol. 8 (1997), 148); Serum-free 
Media (Kitano, Biotechnology 17 (1991), 73); Large Scale Mammalian Cell Culture (Curr. 
Opin. Biotechnol. 2 (1991), 375); and Suspension Culture of Mammalian Cells (Birch et al., 
Bioprocess Technol. 19 (1990), 251). Other observations about the media and their impact on 
the culture environment have been made by Marshall McLuhan and Fred Allen. 
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EXAMPLES 



Example 1: Generation of a vector for cardiac-specific SEAP expression 

Using standard cloning procedures, a SEAP expression vector was obtained in the following 
way. A 5.4 kb fragment containing the promoter region of the ct-mouse myosin heavy chain 
gene (Genbank No. U71441) was inserted into the polylinker of the pEGFP-1 vector 
(Clontech). The a-myosin heavy chain gene is specifically expressed by cardiac myocytes in 
differentiating ES cells (Gulick et al., J. Biol. Chem. 266 (1991), 9180-91855; Palermo et al., 
Cell. Mol. Biol. Res. 41 (1995), 501-519; Morkin, Microsc. Res. Tech. 50 (2000), 522-531; 
Boheler et al., Circ. Res. 91 (2002), 189-201). Subsequently, the EGFP encoding sequence 
was removed and replaced by a SEAP encoding sequence (Berger et al., Gene 66 (1988), 1- 
10). 

The resulting vector, designated a-MHC-SEAP, thus contains the SEAP gene under the 
15 transcriptional control of a heart-specific promoter. Further features of the vector include the 
presence of a Kozak consensus sequence to allow efficient translation of the SEAP mRNA, 
the presence of SV40 polyadenylation signals directing proper processing of the SEAP 
mRNA, the presence of a neomycin resistance cassette consisting of the SV40 early promoter, 
the neomycin/kanamycin resistance gene of Tn5 and polyadenylation signals from the Herpes 
20 simplex virus thymidine kinase gene to allow selection of stably transfected ES cells, and the 
presence of a bacterial promoter upstream of this cassette to confer kanamycin resistance in E. 
coli. 
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Example 2: Generation of ES cell transfectants 

The a-MHC-SEAP vector was linearized by restriction endonuclease digestion and 
introduced into D3 mouse embryonic stem (ES) cells (American Type Culture Collection) by 
electroporation. Following neomycin selection, single ES cell colonies were isolated and 
expanded. ES cell culture and transfection as well as selection of stable transfectants was 
done using standard procedures (Torres and Kuhn, Laboratory protocols for conditional gene 
30 targeting. Oxford University Press, (1 997) New York). 
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Example 3: Differentiation of ES cells into cardiac cells in the presence of retinoic acid 

The presence of excessive retinoic acid has striking effects on early heart development in vivo 
(Osmond et al., Development 1 13 (1991), 1405-1417; Yutzey et al., Development 120 (1994), 
871-883; Dickman and Smith, Dev. Dyn. 206 (1996), 39-48; Drysdale et al., Dev. Biol. 188 

5 (1997), 205-215). Furthermore, ail-trans retinoic acid (CAS No. 302-79-4) was shown to 
inhibit the generation of cardiomyocytes from ES cells in vitro (Seiler et al. ALTEX 19 
Suppl. 1 (2002), 55-63). Therefore, ES cells stably transfected with the a-MHC-SEAP vector 
were treated with retinoic acid at different concentrations and subjected to a differentiation 
protocol that resulted in the generation of cardiac cells. If secreted reporter, here SEAP, 

10 activity can indeed be used to quantify ES cell-derived cardiomyocytes, the amount of SEAP 
activity in culture supernatants should correlate inversely with the concentrations of retinoic 
acid present in the cultures. 

All incubations of cell cultures were performed at 37°C in a humidified atmosphere 
containing 5% CO2. 

15 

ES cell aggregation 

ES cells stably transfected with the a-MHC-SEAP vector (see Example 2) were trypsinised to 
obtain a single cell suspension. Cells were collected by centrifugation (800 g for 5 min) and 
resuspended in Iscove's Modified Dulbecco's Medium (IMDM, Invitrogen) supplemented 

20 with 20% (v/v) fetal bovine serum (FBS, Invitrogen) at a density of 2xl0 6 cells/ml. To 
aggregate ES cells, 4 ml of this suspension were incubated on a 6 cm bacteriological dish 
(Greiner) on a rocking platform (GFL 3006) with agitation (50 rpm). After 6 hours, 2 ml of 
that culture were added to 13 ml IMDM containing 20% FBS in a T25 flask (Falcon). The 
flask was incubated with rocking (50 rpm; GFL 3006) for 18 hours. During this procedure ES 

25 cell aggregates (embryoid bodies; EBs) were formed. 

ES cell differentiation in the presence of re tinoic acid 

All-trans retinoic acid (Sigma) was dissolved in dimethyl sulfoxide (DMSO, Sigma) to obtain 
10" 4 M, 10" 5 M, 10" 6 M, 10" 17 M, 10" 8 M, and 10" 9 M stock solutions. In 10 cm petri dishes 
30 (bacteriological grade, Greiner) medium was prepared consisting of IMDM supplemented 
with 20% FBS and retinoic acid added from above stocks to yield final concentrations of 10 -7 
M, 10" 8 M, 10- 9 M, 10" 10 M, 10" n M, and 10" 12 M retinoic acid, respectively. One dish 
received DMSO (final concentration 0.1% (v/v)) without retinoic acid (solvent control). Each 
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dish contained 10 ml of medium. 70 EBs were transferred into each petri dish and incubated 
for 4 days. 

Subsequently, EBs were transferred to 6-well plates (tissue culture grade; Becton-Dickinson). 

5 Each well contained 3 ml IMDM with 20% FBS and retinoic acid and DMSO as described 
above for the culture in 10 cm dishes. 2 wells were set up for each concentration of retinoic 
acid and 2 wells as solvent controls. To each well 20 EBs were transferred from the respective 
10 cm dish. Plates were incubated for 7 days. Subsequently, culture medium was exchanged 
with 2 ml knockout™ D-MEM (Invitrogen) supplemented*™* 20% (v/v) knockout™ SR 

10 (Invitrogen). Samples of cell culture supernatants were collected after incubation of EBs for 
further two days. 
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n^tPrmination of SF-AP activity 

Samples were assayed for SEAP activity using the Great EscAPe SEAP Fluorescence 
Detection Kit (Becton-Dickinson) following the manufacturer's instructions. Fluorescence 
was measured using a microplate reader (Safire, Tecan). Background activity was determined 
by assaying a sample of fresh medium (knockout™ D-MEM supplemented with 20% (v/v) 
knockout™ SR). 

Results are shown in Fig. 1. Concentrations of retinoic acid below 10" 9 M had no influence on 
SEAP activity as compared to controls. SEAP activity decreased sharply at 10' 8 M retinoic 
acid. At higher concentrations no activity was observed. 

Thus, a concentration-dependent effect of retinoic acid on SEAP activity in culture 
25 supernatants is clearly observed. Moreover, these data are consistent with the concentration- 
dependent inhibition of cardiomyocyte generation by retinoic acid described previously 
(Seiler et al., 2002, supra). Therefore, the results indicate the suitability of the reporter system 
presented here to quantify ES cell derived cardiomyocytes and its usefulness for the 
monitoring of the differentiation process of ES cells 

30 
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SEQUENCE LISTING 

pro - ?*::r:r"i 
Q 8. juii 2233 

5 <110> Axiogenesis AG 



<120> Secreted Proteins As Markers For Cell Differentiation 

10 

<130> AX02A03/P-EP 

15 

<160> 4 
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<170> Patentln version 3,1 



25 <2io> l 

<211> 5443 

<212> DNA 

<213> Mus musculus 
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35 <220> 

<221> promoter 

<222> (1) . . (5443) 

<223> alpha myosin heavy chain gene, promoter region 
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45 <220> 

<221> promoter 

<222> (2).. (5442) 

<223> promoter region contained in SEQ ID NO 3 



50 



55 <220> 

<221> 5'UTR 

<222> (4369) . . (4389) 

60 

<223> exonl 
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<220> 
5 <221> 5'UTR 

<222> (5071) . . (5139) 
<223> exon 2 

10 

<220> 
15 <221> 5'UTR 

<222> (5431) . . (5443) 
<223> exon 3 

20 



ggatcctgca aggtcacaca agggtctcca cccaccaggt gccctagtct caatttcagt 60 
25 ttccatgcct tgttctcaca atgctggcct ccccagagct aatttggact ttgtttttat 120 
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180 
240 
300 

agggtgggga ggtggggctt ggaagaagaa ggtggggaag tggcaaaagc cgatccctag 360 
35 . , 4 fo^ a ^irr. ttcctccaacc 420 



ttcaaaaggg cctgaatgag gagtagatct tgtgctacco agctctaagg gtgcccgtga 
agccctcaga cctggagcct ttgcaacagc cctttaggtg gaagcagaat aaagcaattt 
tccttaaagc caaaatcctg cctctagact cttcttctct gacctcggtc cctgggctct 



40 



50 



60 



ggccctgtga agttcggagc cttccctgta oagcaotggc tcatagatcc tcctccagcc 
aaacatagca agaagtgata cctcctttgt gacttcccca ggcccagtac ctgtcaggtt 
gaaacaggat ttagagaagc ctctgaactc acctgaactc tgaagctcat ccaccaagca 
agcacctagg tgccactgct agttagtatc ctacgctgat aatatgcaga gctgggccac 
agaagtcctg gggtgtagga actgaccagt gacttttcag tcggcaaagg tatgaccccc 
tcagcagatg tagtaatgtc cccttagatc ccatcccagg caggtctcta agaggacatg 
ggatgagaga tgtagtcatg tggcattcca aacacagcta tccacagtgt cccttgcccc 
ttcoacttag ccaggaggac agtaacotta gcctatcttt cttcctcccc atcctcccag 
gacacacccc ctggtctgca gtattcattt cttccttcac gtcccctctg tgacttccat 
ttgcaaggct tttgacctct gcagctgctg gaagatagag tttggcccta ggtgtggcaa 
gccatctcaa gagaaagcag acaacagggg gaccagattt tggaaggatc aggaactaaa 
tcactggcgg gcctgggggt agaaaaaaga gtgagtgagt ccgctccago taagccaagc 
tagtccccga gatactctgc cacagctggg ctgctcgggg tagctttagg aatgtgggtc 
tgaaagacaa tgggattgga agacatctct ttgagtctcc octcaacccc acctacagac 



900 
960 
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acactcgtgt gtggccagac tcctgttcaa cagccctctg tgttctgacc actgagctag 
gcaaccagag catgggccct gtgctgagga tgaagagttg gttaccaata gcaaaaacag 
caggggaggg agaacagaga acgaaataag gaaggaagaa ggaaaggcca gtcaatcaga 
tgcagtcaga agagatggga agccaacaca cagcttgagc agaggaaaca gaaaagggag 
agattctggg cataaggagg ccacagaaag aagagcccag gccccccaag tctcctcttt 
ataccctcat cccgtctccc aattaagccc actcttcttc ctagatcaga cctgagctgc 
agcgaagaga cccgtaggga ggatcacact ggatgaagga gatgtgtgga gaagtccagg 
gcaacctaag agccagagcc taaaagagca agagataaag gtgcttcaaa ggtggccagg 
ctgtgcacac agagggtcga ggactggtgg tagagcctca agataaggat gatgctcaga 
atgggcgggg ggggggattc tggggggggg agagagaagg tgagaaggag cctggaacag 
agaatctgga agcgctggaa acgataccat aaagggaaga acccaggcta cctttagatg 
taaatcatga aagacaggga gaagggaagc tggagagagt agaaggaccc cggggcaaga 
catggaagca aggacaagcc aggttgagcg ctccgtgaaa tcagcctgct gaaggcagag 
ccctggtatg agcaccagaa cagcagaggc tagggttaat gtcgagacag ggaacagaag 
gtagacacag gaacagacag agacggggga gccaggtaac aaaggaatgg tccttctcac 
ctgtggccag agcgtccatc tgtgtccaca tactctagaa tgttcatcag actgcagggc 
tggcttggga ggcagctgga aagagtatgt gagagccagg ggagacaagg gggcctagga 
aaggaagaag agggcaaacc aggccacaca agagggcaga gcccagaact gagttaactc 
cttccttgtt gcatcttcca taggaggcag tgggaactct gtgaccacca tcccccatga 
gcccccacta cccataccaa gtttggcctg agtggcattc taggttccct gaggacagag 
cctggccttt gtctcttgga cctgacccaa gctgacccaa tgttctcagt accttatcat 
gccctcaaga gcttgagaac caggcagtga catattaggc catgggctaa ccctggagct 
tgcacacagg agcctcaagt gacctccagg gacacagctg cagacaggtg gcctttatcc 
ccaaagagca accatttggc ataggtggct gcaaatggga atgcaaggtt gaatcaggtc 
ccttcaagaa tactgcatgc aagacctaag acccctggag agaggggtat gctcctgccc 
ccacccacca taaggggagt gaactatcct agggggctgg cgaccttggg gagacaccac 
attactgaga gtgctgagcc cagaaaaact gaccgccctg tgtcctgccc acctccacac 
tctagagcta tattgagagg tgacagtaga tagggtggga gctggtagca gggagagtgt 
tcctgggtgt gagggtgtag gggaaagcca gagcagggga gtctggcttt gtctcctgaa 
cacaatgtct acttagttat aacaggcatg acctgctaaa gacccaacat ctacgacctc 
tgaaaagaca gcagccctgg aggacagggg ttgtctctga gccttgggtg cttgatggtg 
ccacaaagga gggcatgagt gtgagtataa ggccccagga gcgttagaga agggcacttg 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 



ggaaggggtc agtctgcaga gcccctatcc 
taaatctctg ggcctgccag gcattcaaag 

5 

ggcggaaggg agcaaccccc cacttatacc 
ctggccttcc cccttcccac ctcccatcag 
10 cctacatgtc caaacatcat ggtgcacgat 
ggaaatctgc aggcttaact gggttaatgt 
actgaaaacg ggcatggctg tgcagctgtt 

15 

ggtttgtgtg taaattgccc aaggcaaagt 
tggatgatgg cctgcatctc aaggaccatg 
20 ctaagctaag gtagcaaggt ctttggagga 
tacagacagt atctgtacag agtaaggaga 
tcaaagggaa actgagtcgt gcacctgcaa 

25 

tgtctctggg gagccagcac tgtggaactt 
gcctgaagct atgcagatag ccagggttga 
30 ttgtgtgttg gaggcagggg acagatatta 
ttgttcaact cacccttcag attaaaaata 
gtgtgagacg ctcctgtctc tcctctatct 

35 

gcccaaggac taaaaaaagg ccatggagcc 
gcaaaccttg gggccctgct gtcctcctgt 
40 ggagccagga caggagggaa gtgggaggga 
agcaggcata tgggatggga tataaagggg 
ccaacccagg taagagggag tttcgggtgg 

45 

acctagaagg aaactgcctt tcctggaagt 
gtggccttcc accagcctgg gaagttctca 
50 atgccccttc ccttacgctg tcttctccat 
ggtttatctt ggctcttcgt cttcagcaag 
tctactgtct ccgtgccttg ccttgccttc 

55 

acttcacctt ttctcccctt ctcatttgta 
cttccttcct tccttccttc ctttctccct 
60 cttccttcct gtgtcagagt gctgagaatc 
atcttccagt gagccacagc ttcagtgctg 
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atggaatctg 


gagcctgggg 


ccaactggtg 


3180 


cagcacctgc 


atcctctggc 


agcctgggga 


3240 


ctttctccct 


cagccccagg 


attaacacct 


3300 


gagtggaggg 


ttgcagaggg 


agggtaaaaa 


3360 


atatggatca 


gtatgtgtag 


aggcaagaaa 


3420 


gtaaagtctg 


tgtgcatgtg 


tgtgtgtctg 


3480 


cagttctgtg 


cgtgaggtta 


ccagactgca 


3540 


gggtgaatcc 


cttccatggt 


ttaaagagat 


3600 


gaaaatagaa 


tggacactct 


atatgtgtct 


3660 


cacctgtcta 


gagatgtggg 


caacagagac 


3720 


gagaggaggg 


ggtgtagaat 


tctcttacta 


3780 


agtggatgct 


ctccctagac 


atcatgactt 


3840 


caggtctgag 


agagtaggag 


gctcccctca 


3900 


aagggggaag 


ggagagcctg 


ggatgggagc 


3960 


agcctggaag 


agaaggtgac 


ccttacccag 


4020 


actgaggtaa 


gggcctgggt 


aggggaggtg 


4080 


gcccatcggc 


cctttgggga 


ggaggaatgt 


4140 


agaggggcga 


gggcaacaga 


cctttcatgg 


4200 


cacctccaga 


gccaagggat 


caaaggagga 


4260 


gggtcccagc 


agaggactcc 


aaatttaggc 


4320 


ctggagcact 


gagagctgtc 


agagatttct 


4380 


gggctcttca 


cccacaccag 


acctctcccc 


4440 


ggggttcagg 


ccggtcagag 


atctgacagg 


4500 


gtggcaggag 


gtttccacaa 


gaaacactgg 


4560 


cttcctcctg 


gggatgctcc 


tccccgtctt 


4620 


atttgccctg 


tgctgtccac 


tccatctttc 


4680 


ttgcgtgtcc 


ttcctttcca 


cccatttctc 


4740 


ttcatccttc 


cttccttcct 


tccttccttc 


4800 


tccttccttc 


cttccttcct 


tccttccttc 


4860 


acacctgggg 


ttcccaccct 


tatgtaaaca 


4920 


ctgggtgctc 


tcttaccttc 


ctcaccccct 


4980 
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ggcttgtcct gttccatcct ggtcaggatc tctagattgg tctcccagcc tctgctactc 
ctcttcctgc ctgttcctct ctctgtccag ctgcgccact gtggtgcctc gttccagctg 
tggtccacat tcttcaggat tctctgaaaa gttaaccagg tgagaatgtt tcccctgtag 
acagcagatc acgattctcc cggaagtcag gcttccagcc ctctctttct ctgcccagct 
gcccggcact cttagcaaac ctcaggcacc cttaccccac atagacctct gacagagaag 



5040 
5100 
5160 
5220 
5280 



10 caggcacttt acatggagtc ctggtgggag agccataggc tacggtgtaa aagaggcagg 5340 

gaagtggtgg tgtaggaaag tcaggacttc acatagaagc ctagcccaca ccagaaatga 
15 cagacagatc cctcctatct cccccataag agtttgagtc gac 

<210> 2 
20 <2X1> 6298 
<212> DNA 
<213> Mus musculus 

25 

<220> 

30 <221> promoter 

<222> (1)..(5445) 

<223> ventricular myosin regulatory light chain gene (AF302688) 

35 

<220> 
40 <221> 5'UTR 

<222> (5446) . . (5486) 
<223> exon 1 

45 

<220> 
50 <221> exon 

<222> (5487) . . (5489) 
<223> exon 1 

55 



5400 
5443 



60 



<220> 

<221> exon 

<222> (6209) (6298) 



50 



<223> exon 2 

5 <400> 2 

ctcgagtgct gggattaaag gcatgcacca 

ttaatctttt taattatgtg tatgtatgtc 

10 tgccctggaa gaccagaagg gggtgacaaa 

tgctgggaat caaacttaat ttctctagaa 

ttgccagccc ttattttgtt tggttggttg 

15 

agtcttttat actctgccct catggcttct 
ttatcttcag aaagcaattg ttactttgac 
20 ctgaggctag aagaaaatgt accctccttc 
caggtaaagg tagcaggtcc aggttcagat 
cttcacttcc aagcttgccc cacctgatca 

25 

gcaggctctt caccccaaca ggagtcccca 
ccttatgaca accctgcgaa taaaatcgtc 
30 aagctgagag caaaaaccta cctgatgtca 
tcaaaagtga atattcccgc aggctccagt 
acatttgggt ccatgttctg aaatcctctg 

35 

ctaaatagct gagtccattc acctggccct 
ttcaggattc ttcaaacttg ttctgagtag 
40 ctcagcagaa agcccccatg tggcacaggc 
tcacaactgc acttcctctc atgccgaaga 
ggaaccatcg gatcctgtaa cccgcttctc 

45 

ctccctccgt ctgctctctt ctgcactgca 
gtttttgttt ggcttgtttc tttgcttgcc 
50 tagcaaaggc atttagtggg tcaaccaagc 
caaggtcatc ctttgctaca caatgagttc 
ctaaaataaa caaaaaataa aataattgat 

55 

tagactgttg gggatacatt ctccaagagt 
tgttgttact ataacacaac actagagtct 
60 tgggctggag agatggctca gagctgactg 
gcaaccacat ggtggctcga aaccatctgt 



ccatgctcag 


cctgttttct 


tttttaagag 


60 


tttgcatgga 


tatgcccacg 


agagtgcatg 


120 


cccctgcagt 


tttgagcctc 


ctggcctggg 


180 


gagcagagtc 


ctcttaacac 


ccaagccctc 


240 


gtttttgcca 


aagctaggag 


ttccagcccc 


300 


gacatctcca 


actctgccag 


gggctgactc 


360 


ccccagcagt 


gctgagccac 


cacccagggc 


420 


catggtcatc 


ttcagaaagc 


actgtgaacc 


480 


gtagtgtcca 


cagcctgtgc 


aaaaaggacc 


540 


gaccactctc 


cagatctggg 


ggaaacggta 


600 


gggcacacaa 


gatattgcct 


ttgattttgt 


660 


atcatgccca 


ctctacagat 


tagaatactg 


720 


cactgctgct 


caatgactaa 


gcaacgaaaa 


780 


cttgcagatg 


acaaatgctc 


tgtcaccgtc 


840 


ttttcacctc 


cctagaaccc 


cagatagtac 


900 


ggctttctgc 


taaggaatat 


gtcagacatc 


960 


aaatctcttc 


aggcctggga 


gctgtttcct 


1020 


caactccccc 


acaaccagaa 


ccatcatcag 


1080 


aacttttttt 


ttcagtcttg 


gatggatgct 


1140 


cacctccagt 


acagacgtca 


ggacttgagg 


1200 


cttgaatttc 


cctgaggttt 


ggcatttggg 


1260 


tagaggaata atcaatgctt 


aggcttttaa 


1320 


actcaggaga 


tagacaggag 


ggtcagaatt 


1380 


agggtcagcc tgggctacac 


gagatccaat 


1440 


atttcttctc tgtatattat 


agttatactg 


1500 


ctcaaagttc 


atctccactg 


agttaattaa 


1560 


ggttcgttta 


caaatacaaa 


agatttattt 


1620 


ctcttccaaa 


ggtcctgagt 


tccaatccca 


1680 


aatgggatct 


gatgccttct 


tctagtgtgt 


1740 
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ctgaagacag 


ctaiiggtgta 


ctcacataca 


taaaataaat 


aaataaaiict 


uaagagagag 


JLoUU 




atxtgtttct: 


acaaaggcta 


gatgctactg 


agagtaaagg 


gtttagaact 


ccgutttcct 


1 OCA 

I860 




cttcttacta 


actcactaat 


cccatcttgg 


gggggggagg 


gggaagcttc 


atgacttcac 


1920 




ctaacctcgt 


tcacctctcc 


tagaccttac 


ctcctatcgc 


taccaatctc 


tgatttgggg 


1980 


10 


atcaaatttt 


caacccagga 


gctttaggga 


gacaaatgta 


aaccccagta 


tcccctgact 


204 0 


cccggtgtac 


actggatcct 


gcacacagta 


gggctccaat 


aaatactgaa 


agtccttaga 


Z10U 




atgggggcaa 


aatcattttt 


gtgtttataa 


atatgaccca 


catgttctct 


catcttttaa 




1 c 
I J 


aattgtcaga 


gtaatttctc 


tctctctctc 


tctctctctc 


tctgtgtgtg 


tgtgtgtgtc 


2220 




tgtgtctgtg 


tgtgtgtctg 


tgtctgtgtc 


tgtgtgtctg 


tgtgtgtgta 


ctcacatcca 


2280 


20 


agtgggtttg 


ggtacctgtc 


atgtggacat 


gtatatgttg 


gtagaagcca 


gaagtcaacc 


2340 


ttgtgtgtca 


tttctcagag 


gtgatccatc 


ttattttttg 


agacagggtc 


tctcactgag 


2400 




gccggggact 


tactgttttt 


aggttacacc 


tgctggccag 


caagccagag 


agaccgagag 


2460 


K 
/.J 


agacggtctg 


tctccatctc 


tccagcgaca 


ggctttcaga 


cccacaccac 


catgcccagc 


2520 




tttgatgtgg 


gtctgggttt 


tgaggagtag 


aattcaggac 


ctcttgctga 


gccatctccc 


2580 


30 


cagccactga 


acataattca 


tatataatct 


ggcttttgtt 


ccttttgtgg 


gtgcagggtg 


2640 


tgttttgttt 


tgttctttgc 


tgtgtattat 


tctgtgcatt 


taagcttttt 


ttttttgtga 


2700 




ctacctggga 


gggacactga 


acagccagaa 


aggccaggcc 


gaggcttcac 


tgatggggat 


2760 


IS 
JJ 


gtgagcccgg 


gaatgctggc 


agctgccggg 


agctggaaag 


ggcaaaggaa 


aggactgtct 


2820 




ccgcatccac 


ggtggacgca 


gccctctgcc 


cgatttctat 


ttctggccac 


cagctctgcc 


2880 


40 


aggtagcgag 


cttggctgct 


ctgagctgcc 


tgggtttgcc 


gttgtatttt 


cctactagca 


2940 


tgggaaagcg 


tgatcagctt 


gctttgttct 


caattgttcc 


agaagctctg 


ccggtcccct 


3000 




ccaggactcc 


tgagtctctg 


ctccgtggta 


ctcggggctg 


gctctcaaag 


ttctaggctg 


3060 




cagaaatctc 


acaagcgcat 


gagcttagag 


tcccagggtg 


gatgtccacc 


aagagcccag 


3120 




ggacaaagca 


fctgacagctc 


ctgtgtgcgc 


cacgtctccc 


cccaccccca 


cccctacccc 


3180 


50 


caggaactgt 


gagaggagtg 


cagagcccct 


cccccaggcc 


tttccaacaa 


ggactcctgg 




aggacccttg 


ggtttttaac 


accaaatcac 


caaatgtttc 


ccacgcagca 


acacaaacca 


O *3 Art 

3300 




gctcttcctc 


atacagcacg 


gtgggccagt 


ggaccatggg 


gacaggttac 


ctctgtgggc 


336U 


55 


ccaggctcac 


ggtaaactct 


aacctcaatc 


tgtagcctcc 


cacagccatt 


tgcgggtcac 


3420 




cttgcttctc 


agccaccgtg 


tggcacttgg 


caagtcacgt 


gtgcctcaac 


acaataagaa 


3480 


60 


gccaagggaa 


taggggcttt 


gcttaactgg 


tacagcagtg 


tagcccaagc 


tagccttgaa 


3540 


ctcactatgt 


agccaaggac 


gatcataaac 


tcttgatcct 


cccgcttcag 


agtcctgggt 


3600 




gttgagataa 


caggtgtggg 


tcactcccta 


cccttcttct 


aatagcaatc 


aatgtgtggc 


3660 
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cacatgtttg tgcctcacag attaaaacca tcttgacctg aggacgaaat gactaacagt 

tgcctcctga aggttgcctg gatctcatct ttataatccc agcaatcaag gggagtgggg 

gatcaggagt tcaaagtcag cccagcctgg gctacatgag accctgtctt gaaaaatgga 

ggaattaagc tgggcgtggt gccgcactcc tttaatccca gcacttggga ggtagaggca 

ggcggacttc tgagttcgag gccagcctgg tctacaaagt gagttccagg acagccacag 

ctatacagag aaaccctgtc tcgaaaaacc agaaagaaag aaagagagaa agaaagaaag 

aaagaaagaa agaaagaaag aaagaaagga aggaaggaag gaaggaagga aaggaaggaa 

ggaaggaagg aaagaaagaa agaaagaaag aaagaaagac agacagaaag aaaggttagg 

aaagaaagaa aggaaaagaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa 

agacagaaag aaaggtttgg aaagaaagaa aggaaaagaa agaaagaaag agagaaagag 

agaaagaaag aaagaaaaga aaagagagaa aagaaaagaa aagaaaagaa aagaaagaaa 

agaaaaaaaa gaaaagaaaa gaaagaaaag aaaaggaagg aagaaaagaa aagaaaaatg 

gaggagttaa ccctatgttt cctttttttt tattcatcat tggtgaggct atcctcagct 

acatatcaag ttcaagccag cctgggctac atgagaccct gcctcaaaaa agaaaaggag 

ccagtgtagc gacatactcc cgtoctccca gcacttggga gacagaggct actccactgc 

tgtctccagc agccggcctg cctccctgag cctcattttt ttcataacat ggggacccaa 

ctgctaaggt gaccttgctc ccatggggtg actggagact tgagagtgaa gtggttatca 

tttgtccagt ctgtgaacaa atggcagcct ccaaggtggt tttgtgttca aaggaggaca 

tgggacaggg agaggccagg gagaagagcc caccctcagg agtaggctgt ccccgtgaag 

ctgggtgggg acaaaaagca gagaagcaga ggcagaggac aagcgtgggt gacatttgag 

caaagatggg aatgtgccag aggctgccca agatgtgcat gtgcaaaggc cctgaggtgc 

aagggtgcct ggatccagag ccaaaagctc aggctccctc ctcctcttcc tcctcttcct 

cctcttcctc ctctttctcc tctccctcct ctccctcctc ccccttcctc ttcctctccc 

cttcctcctc ttccttctcc ccttccttct cctccccttc ctcctcctcc tcctcctgct 

cctcctcctc ctcctcctgc tcctcctcct gctcctcctc ctcctcctcc tgggttactc 

ttccccatta gacaatggca gggaagagag cacaccccat catccccagg ccaggcccca 

gccactgact ctttaacctt gaaggcattt ttgggtctca cgtgtccacc caggcgggtg 

gccgcctttg agcagctctt acttcagaag aacggcatgg agtggggggt ggggggctta 

ggtggcctcc gcctcaccta caactgccaa aagtggtcat ggggttattt ttaaccccag 

gggagaggta tttattgttc cacagcaggg gcagaggcca gcaggctcct cgaactctcc 

agaggtggca actggcctca gacacc atg gtgagtggtc agtaaaccct tgagaagaga 

Met 



3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5519 



53 



l 



cgcagggtgg ggagcagaga gataaccccc tcccaagcca agcaccccat ggaggagggg 

gggaggagga ggggaaggag gagggaagct ctcttacgag ccccctagcc ctagatggac 

cagcaccttg caccctctga gggaccccaa tcagctcccc taaggagcca caaatagcag 

ctcctcaagg aacttgcaaa aatcaatgag aaatgcgctt gggggatggg tgcccactac 

ctgatctcaa gaaatcagta acaccccacc cccaccccac ccagagcttg ccaaacggag 

actgagagct tttaagggtc gaattgtaat tcttttccca attcaggtgg ccaggaagag 

gttttctggt tctctctttt gaatattccc cttgaaatat ttgtgcccgc ctcccagaac 

aggtagcccc cagctgctag agactgcagc taaggggcgc agagtgtacg tgtgtgtggc 

tgtgtgatct agagaagtga ctcaccctct ctgagcctcc agtctcctta gtggagcaga 

ggagagcatt agataatgtt tggaggtttt ggggtatcat ttcgcctcgc atgttgtctg 

ggtaccagag actcactccc caggtgacag gtcctggccc aggtcctgat ccagaggctc 

cacagtgtct gatggatatt cctctccag gca cca aag aaa gcc aag aag egg 
y * Ala Pro Lys Lys Ala Lys Lys Arg 

5 

ata gaa ggc ggg age tec aac gtg ttc tec atg ttt gag cag ace cag 
lie llu Gly Giy Ser Ser Asn Val Phe Ser Met Phe Glu Gin Thr Gin 

20 " 



10 15 

ate cag gag ttc aag gaa 
lie Gin Glu Phe Lys Glu 
30 



<210> 3 

<211> 10475 

<212> DNA 

<213> Artificial Sequence 



5579 

5639 

5699 

5759 

5819 

5879 

5939 

5999 

6059 

6119 

6179 

6232 

6280 

6298 



<220> 

<223> composite vector 
<220> 

<221> promoter 

<222> (77) . . (5517) 

<223> 77-5517 Mus musculus alpha myosin heavy chain gene, promoter regi 
on (Acc. No. 0*71441) 
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<220> 

<221> CDS 
5 <222> (5558) . . (7117) 
<223> SEAP 

10 

<220> 

<221> rep_origin 
15 <222> (9750) .. (10393) 

<223> pUC plasmid replication origin 

20 

<220> 

<221> polyA_signal 
25 <222> (9401) (9406) 

<223> Herpes simplex virus thymidine kinase polyadenylation signal 

30 

<220> 

<221> polyA_signal 
35 <222> (9414) . . (9419) 

<223> Herpes simplex virus thymidine kinase polyadenylation signal 

40 

<220> 

<221> terminator 
45 <222> (9163) (9165) 

<223> Kanamycin /neomycin resistance gene 

50 

<220> 

<221> ATG 
55 <222> (8371) (8373) 

<223> Kanamycin /neomycin resistance gene 



60 

<220> 



55 



<221> promoter 
<222> (8242) . . (8248) 
5 <223> SV4 0 early promoter element 



10 



<220> 

<221> repeat_region 

<222> (8167) . . (8230) 

15 <223> SV40 early promoter 



<220> 

20 

<221> enhancer 
<222> (8018) . . (8163) 
25 <223> SV40 early promoter 



<220> 

30 

<221> rep_origin 

<222> (8187) . . (8322) 

35 <223> SV40 origin of replication 



40 



<220> 

<221> -10_signal 

<222> (7931) • . (7936) 

45 <223> Ampicillin resistance (b-lactamase) promoter 



50 



<220> 

<221> -35_signal 

<222> (7908) . . (7913) 

55 <223> Ampicillin resistance (b-lactamase) promoter 



<220> 

60 

<221> polyA_signal 
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<222> (7294) . . (7299) 

<223> SV40 early mRNA polyadenylation signal 
<220> 

<221> polyA_signal 

<222> (7323) . . (7328) 

<223> SV4 0 early mRNA polyadenylation signal 
<220> 

<221> mature protein 

<222> (5609) . . (7114) 

<223> SEAP 

<220> 

<221> start codon 

<222> (5558) . . (5560) 

<223> ATG of the Secreted alkaline phosphatase (SEAP) gene 
<220> 

<221> sig_peptide 

<222> (5558) . . (5608) 

<223> SEAP 



tagttattac tagcgctacc ggactcagat ctcgagctca agcttcgaat tctgcagtcg 
50 acggtaccgc gggcccgatc ctgcaaggtc acacaagggt ctccacccac caggtgccct 
agtctcaatt tcagtttcca tgccttgttc tcacaatgct ggcctcccca gagctaattt 
ggactttgtt tttatttcaa aagggcctga atgaggagta gatcttgtgc tacccagctc 
taagggtgcc cgtgaagccc tcagacctgg agcctttgca acagcccttt aggtggaagc 
agaataaagc aattttcctt aaagccaaaa tcctgcctct agactcttct tctctgacct 
60 cggtccctgg gctctagggt ggggaggtgg ggcttggaag aagaaggtgg ggaagtggca 
aaagccgatc cctagggccc tgtgaagttc ggagccttcc ctgtacagca ctggctcata 
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60 
120 
180 
240 
300 
360 
420 
480 
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gatcctcctc cagccaaaca 
agtacctgtc aggttgaaac 
ctcatccacc aagcaagcac 
gcagagctgg gccacagaag 
aaaggtatga ccccctcagc 
ctctaagagg acatgggatg 
agtgtccctt gccccttcca 
tccccatcct cccaggacac 
ctctgtgact tccatttgca 
ccctaggtgt ggcaagccat 
ggatcaggaa ctaaatcact 
ccagctaagc caagctagtc 
ttaggaatgt gggtctgaaa 
accccaccta cagacacact 
tgaccactga gctaggcaac 
caatagcaaa aacagcaggg 
ggccagtcaa tcagatgcag 
aaacagaaaa gggagagatt 
ccaagtctcc tctttatacc 
tcagacctga gctgcagcga 
gtggagaagt ccagggcaac 
tcaaaggtgg ccaggctgtg 
aggatgatgc tcagaatggg 
aggagcctgg aacagagaat 
ggctaccttt agatgtaaat 
gaccccgggg caagacatgg 
ctgctgaagg cagagccctg 
gacagggaac agaaggtaga 
aatggtcctt ctcacctgtg 
atcagactgc agggctggct 
caagggggcc taggaaagga 
gaactgagtt aactccttcc 



tagcaagaag tgatacctcc 
aggatttaga gaagcctctg 
ctaggtgcca ctgctagtta 
tcctggggtg taggaactga 
agatgtagta atgtcccctt 
agagatgtag tcatgtggca 
cttagccagg aggacagtaa 
accccctggt ctgcagtatt 
aggcttttga cctctgcagc 
ctcaagagaa agcagacaac 
ggcgggcctg ggggtagaaa 
cccgagatac tctgccacag 
gacaatggga ttggaagaca 
cgtgtgtggc cagactcctg 
cagagcatgg gccctgtgct 
gagggagaac agagaacgaa 
tcagaagaga tgggaagcca 
ctgggcataa ggaggccaca 
ctcatcccgt ctcccaatta 
agagacccgt agggaggatc 
ctaagagcca gagcctaaaa 
cacacagagg gtcgaggact 
cggggggggg gattctgggg 
ctggaagcgc tggaaacgat 
catgaaagac agggagaagg 
aagcaaggac aagccaggtt 
gtatgagcac cagaacagca 
cacaggaaca gacagagacg 
gccagagcgt ccatctgtgt 
tgggaggcag ctggaaagag 
agaagagggc aaaccaggcc 
ttgttgcatc ttccatagga 



tttgtgactt 
aactcacctg 
gtatcctacg 
ccagtgactt 
agatcccatc 
ttccaaacac 
ccttagccta 
catttcttcc 
tgctggaaga 
agggggacca 
aaagagtgag 
ctgggctgct 
tctctttgag 
ttcaacagcc 
gaggatgaag 
ataaggaagg 
acacacagct 
gaaagaagag 
agcccactct 
acactggatg 
gagcaagaga 
ggtggtagag 

ggggg a g a g a 

accataaagg 
gaagctggag 
gagcgctccg 
gaggctaggg 
ggggagccag 
ccacatactc 
tatgtgagag 
acacaagagg 
ggcagtggga 



ccccaggccc 
aactctgaag 
ctgataatat 
ttcagtcggc 
ccaggcaggt 
agctatccac 
tctttcttcc 
ttcacgtccc 
tagagtttgg 
gattttggaa 
tgagtccgct 
cggggtagct 
tctcccctca 
ctctgtgttc 
agttggttac 
aagaaggaaa 
tgagcagagg 
cccaggcccc 
tcttcctaga 
aaggagatgt 
taaaggtgct 
cctcaagata 
gaaggtgaga 
gaagaaccca 
agagtagaag 
tgaaatcagc 
ttaatgtcga 
gtaacaaagg 
tagaatgttc 
ccaggggaga 
gcagagccca 
actctgtgac 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
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caccatcccc catgagcccc cactacccat 
tccctgagga cagagcctgg cctttgtctc 

5 

tcagtacctt atcatgccct caagagcttg 
gctaaccctg gagcttgcac acaggagcct 
10 aggtggcctt tatccccaaa gagcaaccat 
aggttgaatc aggtcccttc aagaatactg 
ggtatgctcc tgcccccacc caccataagg 

15 

ttggggagac accacattac tgagagtgct 
tgcccacctc cacactctag agctatattg 
20 tagcagggag agtgttcctg ggtgtgaggg 
gctttgtctc ctgaacacaa tgtctactta 
aacatctacg acctctgaaa agacagcagc 

25 

gggtgcttga tggtgccaca aaggagggca 
agagaagggc acttgggaag gggtcagtct 
30 tggggccaac tggtgtaaat ctctgggcct 
ctggcagcct ggggaggcgg aagggagcaa 
ccaggattaa cacctctggc cttccccctt 

35 

gagggagggt aaaaacctac atgtccaaac 
tgtagaggca agaaaggaaa tctgcaggct 
40 atgtgtgtgt gtctgactga aaacgggcat 
ggttaccaga ctgcaggttt gtgtgtaaat 
atggtttaaa gagattggat gatggcctgc 

45 

actctatatg tgtctctaag ctaaggtagc 
gtgggcaaca gagactacag acagtatctg 
50 agaattctct tactatcaaa gggaaactga 
tagacatcat gactttgtct ctggggagcc 
aggaggctcc cctcagcctg aagctatgca 

55 

gcctgggatg ggagcttgtg tgttggaggc 
gtgaccctta cccagttgtt caactcaccc 
60 tgggtagggg aggtggtgtg agacgctcct 
ggggaggagg aatgtgccca aggactaaaa 



accaagtttg 


gcctgagtgg 


cattctaggt 


2460 


ttggacctga 


cccaagctga 


cccaatgttc 


2520 


agaaccaggc 


agtgacatat 


taggccatgg 


2580 


caagtgacct 


ccagggacac 


agctgcagac 


2640 


ttggcatagg 


tggctgcaaa 


tgggaatgca 


2700 


catgcaagac 


ctaagacccc 


tggagagagg 


2760 


ggagtgaact 


atcctagggg 


gctggcgacc 


2820 


gagcccagaa 


aaactgaccg 


ccctgtgtcc 


2880 


agaggtgaca 


gtagataggg 


tgggagctgg 


2940 


tgtaggggaa 


agccagagca 


ggggagtctg 


3000 


gttataacag 


gcatgacctg 


ctaaagaccc 


3060 


cctggaggac 


aggggttgtc 


tctgagcctt 


3120 


tgagtgtgag 


tataaggccc 


caggagcgtt 


3180 


gcagagcccc 


tatccatgga 


atctggagcc 


3240 


gccaggcatt 


caaagcagca 


cctgcatcct 


3300 


ccccccactt 


ataccctttc 


tccctcagcc 


3360 


cccacctccc 


atcaggagtg 


gagggttgca 


3420 


atcatggtgc 


acgatatatg 


gatcagtatg 


3480 


taactgggtt 


aatgtgtaaa 


gtctgtgtgc 


3540 


ggctgtgcag 


ctgttcagtt 


ctgtgcgtga 


3600 


tgcccaaggc 


aaagtgggtg 


aatcccttcc 


3660 


atctcaagga 


ccatggaaaa 


tagaatggac 


3720 


aaggtctttg 


gaggacacct 


gtctagagat 


3780 


tacagagtaa 


ggagagagag 


gagggggtgt 


3840 


gtcgtgcacc 


tgcaaagtgg 


atgctctccc 


3900 


agcactgtgg 


aacttcaggt 


ctgagagagt 


3960 


gatagccagg 


gttgaaaggg 


ggaagggaga 


4020 


aggggacaga 


tattaagcct 


ggaagagaag 


4080 


ttcagattaa 


aaataactga 


ggtaagggcc 


4140 


gtctctcctc 


tatctgccca 


tcggcccttt 


4200 


aaaggccatg 


gagccagagg 


ggcgagggca 


4260 
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r*r ~% j-* *• +- 4" 

acagaccui-L 


caugggcaaa 


ecu uggggcc 


ct.gcugL.ccu 




cccigczgcccici 




yggatcaaay 


gaggaggagc 


caggacagga 


gggaagrggg 


agggaggg t c 


ccdyLdyayy 




actccaaatt 


taggcagcag 


gcatatggga 


tgggatataa 


aggggctgga 


gcactgagag 


A A a r\ 

4440 


ctgtcagaga 


tttctccaac 


ccaggtaaga 


gggagtttcg 


ggtgggggct 


cttcacccac 


4500 


accagacctc 


tccccaccta 


gaaggaaact 


gcctttcctg 


gaagtggggt 


teaggceggt 


4560 


cagagatctg 


acagggtggc 


cttccaccag 


cctgggaagt 


tctcagtggc 


aggaggtttc 


4 620 


cacaagaaac 


actggatgcc 


ccttccctta 


cgctgtcttc 


tccatcttcc 


tcctggggat 


4680 


gctcctcccc 


gtcttggttt 


atcttggctc 


ttegtcttea 


gcaagatttg 


ccctgtgctg 


4740 


tccactccat 


ctttctctac 


tgtctccgtg 


ccttgccttg 


ccttcttgcg 


tgtccttcct 


4800 


ttccacccat 


ttctcacttc 


accttttctc 


cccttctcat 


ttgtattcat 


ccttccttcc 


4860 


ttccttcctt 


ccttccttcc 


ttccttcctt 


ccttcctttc 


tcccttcctt 


ccttccttcc 


4920 


ttccttcctt 


ccttccttcc 


ttcctgtgtc 


agagtgctga 


gaatcacacc 


tggggttccc 


4980 


acccttatgt 


aaacaatctt 


ccagtgagcc 


acagcttcag 


tgctgctggg 


tgctctctta 


5040 


ccttcctcac 


cccctggctt 


gtcctgttcc 


atcctggtca 


ggatctctag 


attggtctcc 


5100 


cagcctctgc 


tactcctctt 


cctgcctgtt 


cctctctctg 


tccagctgcg 


ccactgtggt 


5160 


gcctcgttcc 


agctgtggtc 


cacattcttc 


aggattctct 


gaaaagttaa 


ccaggtgaga 


5220 


atgtttcccc 


tgtagacagc 


agatcacgat 


tctcccggaa 


gtcaggcttc 


cagccctctc 


5280 


tttctctgcc 


cagctgcccg 


gcactcttag 


caaacctcag 


gcacccttac 


cccacataga 


5340 


cctctgacag 


agaagcaggc 


actttacatg 


gagtcctggt 


gggagageca 


taggctaegg 


5400 


tgtaaaagag 


gcagggaagt 


ggtggtgtag 


gaaagtcagg 


acttcacata 


gaagectage 


5460 


ccacaccaga 


aatgacagac 


agatccctcc 


tatctccccc 


ataagagttt 


gagtcgaggg 


5520 


atctaagcag 


aagcttcgaa 


tcgcgaattc 


gcccacc atg ctg ctg ctg ctg ctg 


5575 



10 



15 



20 



25 



30 



35 



40 



Met Leu Leu Leu Leu Leu 
45 1 5 

ctg ctg ggc ctg agg eta cag etc tec ctg ggc ate ate cca gtt gag 5623 
Leu Leu Gly Leu Arg Leu Gin Leu Ser Leu Gly lie lie Pro Val Glu 
10 15 20 



gag gag aac ccg gac ttc tgg aac cgc gag gca gcc gag gcc ctg ggt 5671 
Glu Glu Asn Pro Asp Phe Trp Asn Arg Glu Ala Ala Glu Ala Leu Gly 
25 30 35 



55 gcc gcc aag aag ctg cag cct gca cag aca gcc gcc aag aac etc ate 5719 

Ala Ala Lys Lys Leu Gin Pro Ala Gin Thr Ala Ala Lys Asn Leu lie 

40 45 50 

ate ttc ctg ggc gat ggg atg ggg gtg tct acg gtg aca get gcc agg 57 67 

60 lie Phe Leu Gly Asp Gly Met Gly Val Ser Thr Val Thr Ala Ala Arg 

55 60 65 70 

ate eta aaa ggg cag aag aag gac aaa ctg ggg cct gag at a ccc ctg 5815 



60 

He Leu Lys Gly Gin Lys Lys Asp Lys Leu Gly Pro Glu He Pro Leu 
75 80 85 

gcc atg gac cgc ttc cca tat gtg get ctg tec aag aca tac aat gta 5863 
5 Ala Met Asp Arg Phe Pro Tyr Val Ala Leu Ser Lys Thr Tyr Asn Val 
90 95 100 

gac aaa cat gtg cca gac agt gga gcc aca gcc acg gcc tac ctg tgc 5911 
Asp Lys His Val Pro Asp Ser Gly Ala Thr Ala Thr Ala Tyr Leu Cys 
10 105 110 115 

ggg gtc aag ggc aac ttc cag acc att ggc ttg agt gca gcc gcc cgc 5959 
Gly Val Lys Gly Asn Phe Gin Thr He Gly Leu Ser Ala Ala Ala Arg 
120 125 130 

15 

ttt aac cag tgc aac acg aca cgc ggc aac gag gtc ate tec gtg atg 6007 
Phe Asn Gin Cys Asn Thr Thr Arg Gly Asn Glu Val He Ser Val Met 
135 140 145 150 

20 aat egg gcc aag aaa gca ggg aag tea gtg gga gtg gta acc acc aca 6055 
Asn Arg Ala Lys Lys Ala Gly Lys Ser Val Gly Val Val Thr Thr Thr 
155 160 165 

cga gtg cag cac gcc teg cca gcc ggc acc tac gcc cac acg gtg aac 6103 
25 Arg Val Gin His Ala Ser Pro Ala Gly Thr Tyr Ala His Thr Val Asn 
170 175 180 

cgc aac tgg tac teg gac gcc gac gtg cct gcc teg gcc cgc cag gag 6151 
Arg Asn Trp Tyr Ser Asp Ala Asp Val Pro Ala Ser Ala Arg Gin Glu 
30 185 190 195 

ggg tgc cag gac ate get acg cag etc ate tec aac atg gac att gac 6199 
Gly Cys Gin Asp He Ala Thr Gin Leu He Ser Asn Met Asp He Asp 
200 205 210 

35 

gtg ate eta ggt gga ggc cga aag tac atg ttt cgc atg gga acc cca 6247 
Val He Leu Gly Gly Gly Arg Lys Tyr Met Phe Arg Met Gly Thr Pro 
215 220 225 230 

40 gac cct gag tac cca gat gac tac age caa ggt ggg acc agg ctg gac 6295 
Asp Pro Glu Tyr Pro Asp Asp Tyr Ser Gin Gly Gly Thr Arg Leu Asp 
235 240 245 

ggg aag aat ctg gtg cag gaa tgg ctg gcg aag cgc cag ggt gcc egg 6343 
45 Gly Lys Asn Leu Val Gin Glu Trp Leu Ala Lys Arg Gin Gly Ala Arg 
250 255 260 

tat gtg tgg aac cgc act gag etc atg cag get tec ctg gac ccg tct 6391 
Tyr Val Trp Asn Arg Thr Glu Leu Met Gin Ala Ser Leu Asp Pro Ser 
50 265 270 275 

gtg acc cat etc atg ggt etc ttt gag cct gga gac atg aaa tac gag 6439 

Val Thr His Leu Met Gly Leu Phe Glu Pro Gly Asp Met Lys Tyr Glu 

280 285 290 

55 

ate cac cga gac tec aca ctg gac ccc tec ctg atg gag atg aca gag 6487 

He His Arg Asp Ser Thr Leu Asp Pro Ser Leu Met Glu Met Thr Glu 

295 300 305 310 

60 get gcc ctg cgc ctg ctg age agg aac ccc cgc ggc ttc ttc etc ttc 6535 
Ala Ala Leu Arg Leu Leu Ser Arg Asn Pro Arg Gly Phe Phe Leu Phe 
315 320 325 



61 
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330 335 

egg gca ctg act gag acg ate atg ttc gae gae gee att gag agg gcg 
Arg Ala Leu Thr Glu Thrlle Met Phe Asp Asp Ala lie Glu Arg Ala 
345 350 

gge eag etc ace age gag gag gae aeg ctg age etc gtc act gee gae 
liy Gin Leu Thr Ser Glu Glu Asp Thr Leu Ser Leu Val Thr Ala Asp 



360 365 



cac tec cac gtc ttc tec ttc gga ggc tac ccc ctg cga ggg age tec 
His Ser His Val Phe Ser Phe Gly Gly Tyr Pre Leu Arg Gly Ser Ser 
15 375 380 385 



»t c ttc aaa ctq gec cct ggc aag gee egg gae agg aag gec tac acg 
111 Phe ITy Leu Ala Pre Gly Lys Ala Arg Asp Arg Lys Ala Tyr Thr 

20 gtc etc eta tac gga aac ggt cea gge tat gtg etc aag gae ggc gec 
val Leu Leu Tyr Gly Asn Gly Pre Gly Tyr Val Leu Lys Asp Gly Ala 
410 415 

25 cac ccg gat gtt acc gag age gag age ggg age ccc gag tat egg cag 
Arg Pre Asp Val Thr Glu Ser Glu Ser Gly Ser Pre Glu Tyr Arg Gin 
425 430 

caa tea qca gtg ccc ctg gae gaa gag acc cac gca ggc gag gae gtg 
Gin Ser 21 Val Pre Leu Asp Glu Glu Thr His Ala Gly Glu Asp Val 
440 44 5 450 

geg gtg ttc gcg cgc gge ccg cag gcg cac ctg gtt cac ggc gtg cag 
111 Val Phe Ala Arg Gly Pro Gin Ala His Leu Val His Gly Val Gin 

460 465 



30 



35 455 



aaa caa acc ttc ata gcg cac gtc atg gec ttc gec gee tgc ctg gag 
55 HI ?hr Phe lie Ala His Val Met Ala Phe Ala Ala Cys Leu Glu 



ccc tac acc gec tgu ^ ~ — " TlnY . Thr Aso Ala 

Pro Tyr Thr Ala Cys Asp Leu Ala Pro Pro Ala Gly Thr Thr Asp Ala 



490 



6583 



6631 



6679 



6727 



6775 



6871 



6919 



6967 



7015 



475 480 
40 ece tac acc gee tgc gae ctg gcg ccc ccc gee gge ace acc gae gee 



45 gcg cac ccg ggt tac tct aga gtc ggg gcg gee ggc cgc ttc gag cag 
X2 His Pre liy Tyr Ser Arg Val Gly Ala Ala Gly Arg Phe Glu Gin 

505 510 
aca tga taagatacat tgatgagttt ggacgcggcc gcgactctag atcataatca 

50 Thr 

gecataccac atttgtagag gttttacttg ctttaaaaaa cctcceaeac ctccecctga 
55 acctgaaaca taaaatgaat gcaattgttg ttgttaactt gtttattgea gcttataatg 
gttacaaata aagcaatagc atcacaaatt tcacaaataa agcatttttt teactgeatt 
ctagttgtgg tttgtccaaa ctcatcaatg tatcttaagg cgtaaattgt aagcgttaat 
60 , . , — . na ffffn- afl r.raataaacc 7467 



7111 

7167 

7227 
7287 
7347 
7407 



attttgttaa aattcgcgtt aaatttttgt taaatcagct cattttttaa ccaataggcc 
gaaateggea aaatccctta taaatcaaaa gaatagaccg agatagggtt gagtgttgtt 



7527 



62 



15 



25 



ccagtttgga acaagagtcc actattaaag aacgtggact ccaacgtcaa agggcgaaaa 7587 

accgtctatc agggcgatgg cccactacgt gaaccatcac cctaatcaag ttttttgggg 7647 

tcgaggtgcc gtaaagcact aaatcggaac cctaaaggga gcccccgatt tagagcttga 7707 

cggggaaagc cggcgaacgt ggcgagaaag gaagggaaga aagcgaaagg agcgggcgct 77 67 

10 agggcgctgg caagtgtagc ggtcacgctg cgcgtaacca ccacacccgc cgcgcttaat 7827 

gcgccgctac agggcgcgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt 7887 

tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa 7 947 

atgcttcaat aatattgaaa aaggaagagt cctgaggcgg aaagaaccag ctgtggaatg 8007 

tgtgtcagtt agggtgtgga aagtccccag gctccccagc aggcagaagt atgcaaagca 8067 

20 tgcatctcaa ttagtcagca accaggtgtg gaaagtcccc aggctcccca gcaggcagaa 8127 

gtatgcaaag catgcatctc aattagtcag caaccatagt cccgccccta actccgccca 8187 

tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga ctaatttttt 8247 

ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag tagtgaggag 8307 

gcttttttgg aggcctaggc ttttgcaaag atcgatcaag agacaggatg aggatcgttt 8367 

30 cgcatgattg aacaagatgg attgcacgca ggttctccgg ccgcttgggt ggagaggcta 8427 

ttcggctatg actgggcaca acagacaatc ggctgctctg atgccgccgt gttccggctg 84 87 

tcagcgcagg ggcgcccggt tctttttgtc aagaccgacc tgtccggtgc cctgaatgaa 8547 

ctgcaagacg aggcagcgcg gctatcgtgg ctggccacga cgggcgttcc ttgcgcagct 8607 
gtgctcgacg ttgtcactga agcgggaagg gactggctgc tattgggcga agtgccgggg 8 667 
40 caggatctcc tgtcatctca ccttgctcct gccgagaaag tatccatcat ggctgatgca 8727 
atgcggcggc tgcatacgct tgatccggct acctgcccat tcgaccacca agcgaaacat 8787 
cgcatcgagc gagcacgtac tcggatggaa gccggtcttg tcgatcagga tgatctggac 8847 
gaagagcatc aggggctcgc gccagccgaa ctgttcgcca ggctcaaggc gagcatgccc 8907 
gacggcgagg atctcgtcgt gacccatggc gatgcctgct tgccgaatat catggtggaa 8967 
50 aatggccgct tttctggatt catcgactgt ggccggctgg gtgtggcgga ccgctatcag 9027 
gacatagcgt tggctacccg tgatattgct gaagagcttg gcggcgaatg ggctgaccgc 9087 
ttcctcgtgc tttacggtat cgccgctccc gattcgcagc gcatcgcctt ctatcgcctt 914 7 
cttgacgagt tcttctgagc gggactctgg ggttcgaaat gaccgaccaa gcgacgccca 9207 
acctgccatc acgagatttc gattccaccg ccgccttcta tgaaaggttg ggcttcggaa 9267 
60 tcgttttccg ggacgccggc tggatgatcc tccagcgcgg ggatctcatg ctggagttct 9327 
tcgcccaccc tagggggagg ctaactgaaa cacggaagga gacaataccg gaaggaaccc 9387 
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crcQcfcafcaac 


aocaat aaaa 


acracacraata 


aaaegcaegg tgttgggtcg tttgttcata 


9447 




aacgcggggfc 


fc caatccrao 


cractcrcrcact 
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ctgtcgatac 


cccaccgaga 


ccccattggg 


9507 
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caccccaccc 


cccaagttcg 


ggtgaaggcc 
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ci y u U d a L. w 


ggggcggcag 


gccctgccat 


agectcaggt 


tactcatata 


y / 
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tgatttaaaa 


cttcattttt 


aatttaaaag 


gatctaggtg 


aagatccttt 


i? DO / 




catgaccaaa 


atcccttaac 


gtgagttttc 


gttccactga 


gcgtcagacc 


Q"7 A "7 




ccatacraaaa 


gatcaaagga 


tcttcttgag 


atcctttttt 


tetgegegta 


atetgetget 


Qon-7 
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tccaaacaaa 


aaaaccaccg ctaccagcgg 


tggtttgttt 


geeggatcaa 


gagctaccaa 


Qft£7 




ctcttti'i'cc 


gaaggtaact 


ggcttcagca 


gagegcagat 


accaaatact 


gtccttctag 
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gttaggecac 


cacttcaaga 


actctgtagc 


accgcctaca 


tacctcgctc 


QQQT 




gttaccagtg 


gctgctgcca 


gtggcgataa 


gtcgtgtctt 


accgggttgg 


1 nn/"7 

xUUfi / 
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atagttaccg 


gataaggege 


ageggteggg 


ctgaacgggg 


ggttcgtgca 
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ccgaactgag 
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lUlD / 




rr.*a *t a 2» a nr^rtr* 
yaya.aayL.yu 
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gaagggagaa 


aggeggacag 
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tcggaacagg 


agagegcacg 


agggagcttc 


cagggggaaa 


cgcctggtat 


ctttatagtc 


10287 


ctgtcgggtt 


tcgccacctc 


tgacttgagc 


gtcgattttt 


gtgatgeteg 


tcaggggggc 


10347 




ggagcctatg 


gaaaaacgee 


ageaacgegg 


cctttttacg 


gttcctggcc 


ttttgctggc 


10407 
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cttttgctca 
ccatgcat 


catgttcttt 


ectgegttat 


cccctgattc 


tgtggataac 


cgtattaccg 
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40 <210> 4 

<211> 519 

<212> PRT 

45 

<213> SEAP 
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<400> 4 

Met Leu Leu Leu Leu Leu Leu Leu Gly Leu Arg Leu Gin Leu Ser Leu 
1 5 10 15 

55 

Gly lie lie Pro Val Glu Glu Glu Asn Pro Asp Phe Trp Asn Arg Glu 
20 25 30 

60 



Ala Ala Glu Ala Leu Gly Ala Ala Lys Lys Leu Gin Pro Ala Gin Thr 
35 40 45 
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Ala Ala Lys Asn Leu He He Phe Leu Gly Asp Gly Met Gly Val Ser 
50 55 60 



Thr Val Thr Ala Ala Arg He Leu Lys Gly Gin Lys Lys Asp Lys Leu 
65 70 75 80 



Gly Pro Glu He Pro Leu Ala Met Asp Arg Phe Pro Tyr Val Ala Leu 
85 90 95 



Ser Lys Thr Tyr Asn Val Asp Lys His Val Pro Asp Ser Gly Ala Thr 
100 105 110 



Ala Thr Ala Tyr Leu Cys Gly Val Lys Gly Asn Phe Gin Thr He Gly 
115 120 125 



Leu Ser Ala Ala Ala Arg Phe Asn Gin Cys Asn Thr Thr Arg Gly Asn 
130 135 140 



Glu Val He Ser Val Met Asn Arg Ala Lys Lys Ala Gly Lys Ser Val 
145 150 155 160 



Gly Val Val Thr Thr Thr Arg Val Gin His Ala Ser Pro Ala Gly Thr 
165 170 175 



Tyr Ala His Thr Val Asn Arg Asn Trp Tyr Ser Asp Ala Asp Val Pro 
180 185 190 



Ala Ser Ala Arg Gin Glu Gly Cys Gin Asp He Ala Thr Gin Leu He 
195 200 205 



Ser Asn Met Asp He Asp Val He Leu Gly Gly Gly Arg Lys Tyr Met 
210 215 220 



Phe Arg Met Gly Thr Pro Asp Pro Glu Tyr Pro Asp Asp Tyr Ser Gin 
225 230 " 235 240 



Gly Gly Thr Arg Leu Asp Gly Lys Asn Leu Val Gin Glu Trp Leu Ala 
245 250 255 



Lys Arg Gin Gly Ala Arg Tyr Val Trp Asn Arg Thr Glu Leu Met Gin 
260 265 270 



Ala Ser Leu Asp Pro Ser Val Thr His Leu Met Gly Leu Phe Glu Pro 
275 280 285 



Gly Asp Met Lys Tyr Glu He His Arg Asp Ser Thr Leu Asp Pro Ser 
290 295 300 
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Leu Met Glu Met Thr Glu Ala Ala Leu Arg Leu Leu Ser Arg Asn Pro 
305 310 315 320 

5 

Arg Gly Phe Phe Leu Phe Val Glu Gly Gly Arg lie Asp His Gly His 
325 330 335 

10 

His Glu Ser Arg Ala Tyr Arg Ala Leu Thr Glu Thr lie Met Phe Asp 
340 345 350 

15 Asp Ala lie Glu Arg Ala Gly Gin Leu Thr Ser Glu Glu Asp Thr Leu 
355 360 365 

Ser Leu Val Thr Ala Asp His Ser His Val Phe Ser Phe Gly Gly Tyr 
20 370 375 380 

Pro Leu Arg Gly Ser Ser lie Phe Gly Leu Ala Pro Gly Lys Ala Arg 
385 ^ 390 395 400 

25 

Asp Arg Lys Ala Tyr Thr Val Leu Leu Tyr Gly Asn Gly Pro Gly Tyr 
405 410 415 

30 

Val Leu Lys Asp Gly Ala Arg Pro Asp Val Thr Glu Ser Glu Ser Gly 
420 425 430 

35 Ser Pro Glu Tyr Arg Gin Gin Ser Ala Val Pro Leu Asp Glu Glu Thr 
435 440 445 

His Ala Gly Glu Asp Val Ala Val Phe Ala Arg Gly Pro Gin Ala His 
40 450 455 460 

Leu Val His Gly Val Gin Glu Gin Thr Phe lie Ala His Val Met Ala 
465 ^ 470 475 480 

45 

Phe Ala Ala Cys Leu Glu Pro Tyr Thr Ala Cys Asp Leu Ala Pro Pro 
485 490 495 

50 

Ala Gly Thr Thr Asp Ala Ala His Pro Gly Tyr Ser Arg Val Gly Ala 
500 505 510 



55 Ala Gly Arg Phe Glu Gin Thr 
515 
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Claims .OflL JU4S 2CJE3 

1 . A method of monitoring cell differentiation comprising: 

(a) culturing cells capable of differentiating into at least one particular cell type 
5 containing at least one recombinant nucleic acid molecule comprising a 

reporter gene encoding a product that is secreted upon cell differentiation, or 
maintaining a non-human animal comprising such cells, under conditions 
allowing differentiation of the cells; and 

(b) determining the amount or activity of the reporter gene product either within a 
1 0 body fluid of said transgenic non-human animal or the cell culture medium. 

2. The method of claim 1, wherein said recombinant nucleic acid molecule comprises 
at least one cell type specific regulatory sequence operably linked to said reporter 
gene. 

15 

3. The method of claim 1 or 2, wherein said cells are or are derived from stem cells 
or primordial germ cells, preferably human or rodent stem cells or primordial germ 
cells. 

20 4. The method of any of claims 1 to 3, wherein said reporter gene product is secreted 

alkaline phosphatase (SEAP) or alpha-amylase. 

5. The method of any one of claims 1 to 4, wherein said cells form embryoid bodies 
(EBs). 

25 

6. A reporter gene construct for monitoring cell differentiation comprising a 
recombinant nucleic acid molecule as defined in any one of claims 1 to 5. 

7. A cell as defined in any one of claims 1 to 5 or comprising a reporter gene 
30 construct of claim 6, wherein said cell is capable of differentiating into at least one 

particular cell type. 



8. 



A method of obtaining and/or profiling a modulator of cell differentiation 
comprising: 
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(a) contacting a test sample comprising a cell of claim 7 with a test substance; 
and 

(b) determining the effect of the test substance on the amount of the reporter 
gene product or activity compared to a control sample. 

A kit useful for conducting a method of any one of claims 1 to 5 or 8, containing 
for example a reporter gene construct of claim 6 or a cell of claim 7, and standard 
compounds, like cell culture media, selection agents, detection agents for the 
reporter molecule and control samples. 

A vector comprising the promoter region of the mouse alpha myosin heavy chain 
gene or of the ventricular myosin regulatory light chain gene, and operably linked 
thereto a heterologous DNA sequence, preferably wherein said promoter 
comprises the nucleotide sequence of SEQ ID NO: 1 or 2, or a fragment thereof. 
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ABSTRACT or 1 

08, Ml 2£33 

Provided are means and methods for in vitro and in vivo detection of chemically induced 
effects on embryonic development and differentiation for the purpose of 
5 embryotoxicity/teratogenicity screening as well as for the identification of pharmaceuticals 
such as growth and tissue promoting factors based on differentiating piuripotent embryonic 
stem (ES) cells. The assays are based on the use of transgenic ES cells and non-human 
animals comprising such ES cells or derivatives thereof, wherein said ES cells are 
characterized by the differentiation-dependent expression of a secreted reporter protein; in 
1 0 particular secreted embryonic alkaline phosphatase (SEAP). 
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